SUBUNIT VACCINES WITH A2 SUPERMOTIFS 



Cross-Reference to Related Applications 

[01] This application is a continuation-in-part of U.S. Application Serial No. 
09/346,105, entitled "Consistent Immune Responses in Diverse Genetic Populations " 
5 filed 30 June 1999, Sidney et ah This application also claims the benefit of the 29 
January 2001 filing date of U.S. Application Serial No. 60/264,969, entitled "Subunit 
Vaccines with A2 Supermotifs Sidney et al y each of which is incorporated by reference 
in its entirety. 

Federally Sponsored Research And Development 

10 [02] Subject matter disclosed herein was funded, in part, by the United States 

government under grants from the National Institutes of Health. The U.S. government 
may have certain rights in this invention. 

Technical Field 

[03] Subject matter disclosed herein relates to the design of vaccines which 
1 5 will be effective in large portions of the population, in particular, those members of the 
population who are characterized as having an A2 supertype allele. Subunit vaccines 
which comprise the A2 supeimotif can be designed to effect such population coverage. 

Background 

[04] The genetic makeup of a given mammal encodes the structures associated 
20 with the immune system of that species. Although there is a great deal of genetic 

diversity in the human population, even more so comparing humans and other species, 
there are also common features and effects. In mammals, certain molecules associated 
with immune function are termed the major histocompatibility complex. 

[05] MHC molecules are classified as either Class I or Class II molecules. 
25 Class II MHC molecules are expressed primarily on cells involved in initiating and 

sustaining immune responses, such as T lymphocytes, B lymphocytes, macrophages, etc. 

1 



Class II MHC molecules are recognized by helper T lymphocytes and induce 
proliferation of helper T lymphocytes and amplification of the immune response to the 
particular immunogenic peptide that is displayed. Class I MHC molecules are expressed 
on almost all nucleated cells and are recognized by cytotoxic T lymphocytes (CTLs), 
5 which then destroy the antigen-bearing cells. CTLs are particularly important in tumor 
rejection and in fighting viral infections. 

[06] CTLs recognize the antigen in the form of a peptide fragment bound to the 
MHC class I molecules rather than the intact foreign antigen itself. The antigen must 
normally be endogenously synthesized by the cell, and a portion of the protein antigen is 

1 0 degraded into small peptide fragments in the cytoplasm. Some of these small peptides 
translocate into a pre-Golgi compartment and interact with class I heavy chains to 
facilitate proper folding and association with the subunit /32 microglobulin. The peptide- 
MHC class I complex is then routed to the cell surface for expression and potential 
recognition by specific CTLs. 

1 5 [07] Investigations of the crystal structure of the human MHC class I molecule, 

HLA-A2.1, indicate that a peptide binding groove is created by the folding of the al and 
o2 domains of the class I heavy chain (Bjorkman, et al, Nature 329:506 ( 1987)). In 
these investigations, however, the identity of peptides bound to the groove was not 
determined. 

20 [08] Buus, et al, Science 242:1065 (1988) first described a method for acid 

elution of bound peptides from MHC. Subsequently, Rammensee and his coworkers 
(Falk, et al, Nature 351:290 (1991)) have developed an approach to characterize 
naturally processed peptides bound to class I molecules. Other investigators have 
successfully achieved direct amino acid sequencing of the more abundant peptides in 

25 various HPLC fractions by conventional automated sequencing of peptides eluted from 
class I molecules of the B type (Jardetzky, et al, Nature 353:326 (1991)) and of the A2.1 
type by mass spectrometry (Hunt, et ah, Science 225:1261 (1992)). A review of the 
characterization of naturally processed peptides in MHC Class I has been presented by 
Rotzschke & Falk (Rotzschke & Falk, Immunol Today 12:447 (1991)). PCT publication 

30 WO 97/34621, incorporated herein by reference, describes peptides which have a binding 
motif for A2.1 alleles. 
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[09] Sette, et ai 9 Proa Natl Acad ScL USA 86:3296 (1989) showed that 
MHC allele specific motifs could be used to predict MHC binding capacity. Schaeffer, et 
ah, Proa Nat 1 Acad. ScL USA 86:4649 (1989) showed that MHC binding was related to 
immunogenicity. Others (De Bruijn, et al, Eur. J. Immunol, 21:2963-2970 (1991); 
5 Pamer, et aL, 991 Nature 353:852-955 (1991)) have provided preliminary evidence that 
class I binding motifs can be applied to the identification of potential immunogenic 
peptides in animal models. Class I motifs specific for a number of human alleles of a 
given class I isotype have yet to be described. It is desirable that the combined 
frequencies of these different alleles should be high enough to cover a large fraction or 
1 0 perhaps the majority of the human outbreed population. 

[10] Despite the developments in the art, the prior art has yet to provide a 
useful human peptide-based vaccine or therapeutic agent based on this work. 

Summary 

[11] The invention provides the parameters for the design of vaccines which 

15 are expected to effectively target large portions of the population. Following the 
guidance set forth herein, to prepare vaccines with respect to a particular infectious 
organism or virus or tumor, the relevant antigen is assessed to determine the location of 
epitopes which are most likely to effect a cytotoxic T response to an infection or tumor. 
By analyzing the amino acid sequence of the antigen according to the methods set forth 

20 herein, an appropriate set of epitopes can be identified. Peptides which consist of these 
epitopes can readily be tested for their ability to bind one or more than one HLA 
characteristic of the A2 supertype. In general, peptides which bind with an affinity 
represented by an IC 5 o of 500 nM or less have a high probability of eliciting a cytotoxic T 
lymphocyte (CTL) response. The ability of these peptides to do so can also readily be 

25 verified. Vaccines can then be designed based on the immunogenic peptides thus 

identified. The vaccines themselves can consist of the peptides per se, precursors which 
will be expected to generate the peptides in vivo, or nucleic acids encoding these peptides 
for production in vivo. 

[12] Thus, in one aspect, the invention is directed to a method for identifying 

30 an epitope in an antigen characteristic of a pathogen or of a tumor which epitope is more 
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likely to enhance an immune response in an individual bearing an allele of the A2 
supertype than an arbitrarily chosen peptide, which method comprises analyzing the 
amino acid sequence of the antigen for segments of 8-1 1 amino acids where the segment 
at position 2 is a small or aliphatic hydrophobic residue (A, I, V, L, M or T) and the 
5 amino acid at the C-terminus of the segment is also a small or aliphatic hydrophobic 
residue selected from the same group. In preferred embodiments, the residue at 
position 2 is L or M. In other preferred embodiments, the segment contains 9-10 amino 
acids. In another preferred embodiment, the segment contains Q or N at position 1 and/or 
R, H or K at position 8, and lacks a D, E and G at position 3 when the segment is a 

10 10-mer. Also preferred are V at position 2 and at the C-terminus. 

[13] Also described herein are compositions comprising immunogenic peptides 
having binding motif subsequences for HLA-A2.1 molecules. The immunogenic 
epitopes in the peptides, which bind to the appropriate MHC allele, are preferably 8-11 
residues in length and more preferably 9 to 10 residues in length and comprise conserved 

15 residues at certain positions such as positions 2 and the C-terminus. Moreover, the 

peptides do not comprise negative binding residues as defined herein at other positions 
such as positions 1, 3, 6 and/or 7 in the case of peptides 9 amino acids in length and 
positions 1, 3, 4, 5, 7, 8 and/or 9 in the case of peptides 10 amino acids in length. The 
present invention defines positions within a motif enabling the selection of peptides 

20 which will bind efficiently to HLA A2. 1 . 

[14] Epitopes on a number of immunogenic target proteins can be identified 
using the sequence motifs described herein. Examples of suitable antigens include 
prostate cancer specific antigen (PSA), hepatitis B core and surface antigens (HBVc, 
HBVs) hepatitis C antigens, Epstein-Barr virus antigens, human immunodeficiency type- 

25 1 virus (HIV1), Kaposi's sarcoma herpes virus (KSHV), human papilloma virus (HPV) 
antigens, Lassa virus, mycobacterium tuberculosis (MT), p53, CEA, trypanosome surface 
antigen (TSA) and Her2/neu. The peptides and nucleic acids encoding them are useful in 
pharmaceutical compositions for both in vivo and ex vivo therapeutic and diagnostic 
applications. 
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Definitions 

[15] The term "peptide" is used interchangeably with "oligopeptide" in the 
present specification to designate a series of residues, typically L-amino acids, connected 
one to the other typically by peptide bonds between the alpha-amino and carbonyl groups 
5 of adjacent amino acids. The oligopeptides are generally less than 250 amino acids in 
length, and can be less than 150, 100, 75, 50, 25, or 15 amino acids in length. Further, an 
oligopeptide of the invention can be such that it does not comprise more than 15 
contiguous amino acids of a native antigen. 

[16] The nomenclature used to describe peptide compounds follows the 

10 conventional practice where the amino group is presented to the left (the N-terminus) and 
the carboxyl group to the right (the C-terminus) of each amino acid residue. In the 
formulae representing selected specific embodiments of the present invention, the amino- 
and carboxyl-terminal groups, although not specifically shown, are in the form they 
would assume at physiologic pH values, unless otherwise specified. In the amino acid 

15 structure formulae, each residue is generally represented by standard three letter or single 
letter designations. The L-form of an amino acid residue is represented by a capital 
single letter or a capital first letter of a three-letter symbol, and the D-form for those 
amino acids having D-forms is represented by a lower case single letter or a lower case 
three letter symbol. Glycine has no asymmetric carbon atom and is simply referred to as 

20 "Gly"orG. 

[17] An "immunogenic peptide" or "epitope" is a peptide or amino acid 
sequence which comprises an allele-specific motif such that the peptide sequence will 
bind an MHC molecule and induce a CTL response. Immunogenic peptides of the 
invention are capable of binding to an appropriate HLA-A2 molecule and inducing a 

25 cytotoxic T-cell response against the antigen from which the immunogenic peptide is 
derived. The immunogenic peptides of the invention are less than about 15 residues in 
length, often less than 12 residues in length and usually consist of between about 8 and 
about 1 1 residues, preferably 9 or 10 residues. 

[ 1 8] Immunogenic peptides are conveniently identified using the algorithms of 

30 the invention. The algorithms are mathematical procedures that produce a score which 
enables the selection of immunogenic peptides. Typically one uses the algorithmic score 



with a "binding threshold" to enable selection of peptides that have a high probability of 
binding at a certain affinity and will in turn be immunogenic. The algorithm is based 
upon either the effects on MHC binding of a particular amino acid at a particular position 
of a peptide or the effects on binding of a particular substitution in a motif containing 
5 peptide. 

[19] Binding results are often expressed in terms of "ICso's." IC50 is the 
concentration of peptide in a binding assay at which 50% inhibition of binding of a 
reference peptide is observed. Given the conditions in which the assays as described 
herein are run (i.e., limiting HLA proteins and labeled peptide concentrations), these 

10 values approximate K D values. Assays for determining binding are described in detail in 
PCT publications WO 94/20127 and WO 94/03205. It should be noted that IC50 values 
can change, often dramatically, if the assay conditions are varied, and depending on the 
particular reagents used (e.g., HLA preparation, etc.). For example, excessive 
concentrations of HLA molecules will increase the apparent measured IC50 of a given 

1 5 ligand and therefore not reflect the true Kd value. 

[20] Binding is often expressed as a ratio relative to a reference peptide. As a 
particular assay becomes more, or less, sensitive, the ICso's of the peptides tested may 
change somewhat. However, the binding relative to the reference peptide will not 
significantly change. For example, in an assay run under conditions such that the IC50 of 

20 the reference peptide increases 10-fold, the IC50 values of the test peptides will also shift 
approximately 10-fold. Therefore, to avoid ambiguities, the assessment of whether a 
peptide is a good, intermediate, weak, or negative binder is generally based on its IC50, 
relative to the IC50 of a standard peptide. The binding may be reported as a ratio or the 
ratio may be used to normalize the IC50 value as described in Example L 

25 [21] As used herein, high affinity with respect to HLA class I molecules is 

defined as binding with an IC50 or Kd value of less than 50 nM. Intermediate affinity is 
binding with an IC50 (or K D ) of between about 50 and about 500 nM. 

[22] A "conserved residue" is an amino acid which occurs in a significantly 
higher frequency than would be expected by random distribution at a particular position 

30 in a peptide. Typically a conserved residue is one where the MHC structure may provide 
a contact point with the immunogenic peptide. One to three, preferably two, conserved 



residues within a peptide of defined length defines a motif for an immunogenic peptide. 
These residues are typically in close contact with the peptide binding groove, with their 
side chains buried in specific pockets of the groove itself. Typically, an immunogenic 
peptide will comprise up to three conserved residues, more usually two conserved 
5 residues. 

[23] As used herein, "negative binding residues" are amino acids which if 
present at certain positions (for example, positions 1, 3 and/or 7 of a 9-mer) will result in 
a peptide being a nonbinder or poor binder and in turn fail to be immunogenic i.e. induce 
a CTL response. 

10 [24] The term "motif 1 refers to the pattern of residues in a peptide of defined 

length, usually about 8 to about 1 1 amino acids, which is recognized by a particular MHC 
allele. The peptide motifs are typically different for each human MHC allele and differ in 
the pattern of the highly conserved residues and negative residues. 

[25] The binding motif for an allele can be defined with increasing degrees of 

1 5 precision. In one case, all of the conserved residues are present in the correct positions in 
a peptide and there are no negative residues in positions 1,3 and/or 7. 

[26] A "supermotif" is a peptide binding specificity shared by HLA molecules 
encoded by two or more HLA alleles. A supermotif-bearing epitope preferably is 
recognized with high or intermediate affinity (as defined herein) by two or more HLA 

20 antigens. 

[27] An "HLA supertype or family", as used herein, describes sets of HLA 
molecules grouped on the basis of shared peptide-binding specificities. HLA class I 
molecules that share somewhat similar binding affinity for peptides bearing certain amino 
acid motifs are grouped into HLA supertypes. The terms HLA superfamily, HLA 

25 supertype family, and HLA xx-like supertype molecules (where xx denotes a particular 
HLA type) are synonyms. 

[28] The phrases "isolated" or "biologically pure" refer to material which is 
substantially or essentially free from components which normally accompany it as found 
in its native state. Thus, the peptides of this invention do not contain materials normally 

30 associated with their in situ environment, e.g., MHC I molecules on antigen presenting 
cells. Even where a protein has been isolated to a homogenous or dominant band, there 



are trace contaminants in the range of 5-10% of native protein which co-purify with the 
desired protein. Isolated peptides of this invention do not contain such endogenous co- 
purified protein. 

[29] The term "residue" refers to an amino acid or amino acid mimetic 
5 incorporated in an oligopeptide by an amide bond or amide bond mimetic. 

Brief Description of the Drawings 

Figure 1 . Position 2 and C-terminus fine specificity of HLA-A*0201. The 
preference for specific residues in position 2(a) or at the C-terminus (b) is shown at a 
function of the percent of peptides bearing a specific residue that bind A*0201 with IC50 

10 of 500 nM or better. ARB values of peptides bearing specific residues in position 2 (a) or 
at the C-terminus (b) were calculated as described herein, and indexed relative to the 
residue with the highest binding capacity. The average (geometric) binding capacity of 
peptides with L in position 2 was 1991 nM. The average (geometric) binding capacity of 
peptides with V at the C-terminus was 2133 nM. Peptides included in the analysis had at 

1 5 least one tolerated anchor residue, as described in the text, at either position 2 or the C- 
terminus. 

Figure 2 . Map of the A*0201 motif. Summary map of the A*0201 motif for 8- 
mer (b), 10-mer (c) and 1 1-mer (d) peptides. At secondary anchor positions, residues 
shown as preferred (or deleterious) are associated with an average binding capacity at 

20 least 3-fold greater than (or 3 -fold less than) peptides of the same size carrying other 
residues at the same position. At the primary anchor positions, preferred residues are 
those associated with an average binding capacity within 10-fold of the optimal residue at 
the same position. Tolerated primary anchor residues are those associated with an 
average binding capacity between 10- and 100-fold of the optimal residue at the same 

25 position. 

Figure 3 . Position 2 fine specificity of HLA-A2-supertype molecules. ARB 
values of peptides bearing specific residues in position 2 were calculated for each A2- 
supertype molecule as described in the text, and indexed relative to the residue with the 
highest ARB for each specific molecule. The average (geometric) binding capacity of the 
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peptides bearing the residue with the highest ARB were 55, 59, 89, and 41nM for 
A*0202, A*0206, and A*6802, respectively. 

Figure 4 . C-terminal fine specificity of HLA-A2-supertype molecules. ARB 
values of peptides bearing specific residues at the C-terminus were calculated for each 
5 A2-supertype molecule as described in the text, and indexed relative to the residue with 
the highest ARB for each specific molecule. The average (geometric) binding capacity of 
the peptides bearing the residue with the highest ARB were 291, 48, 250, and 553 nM for 
A*0202 5 A*0203, A*0206, and A*6802, respectively. 

Figure 5 . Map of the A*0202 motif. Summary map of A*0202 motif for 9-mer 

10 (a) and 10-mer (b) peptides. At secondary anchor positions, residues shown as preferred 
(or deleterious) are associated with an average binding capacity at least 3-fold greater 
than (or 3-fold less than) peptides of the same size carrying other residues at the same 
position. At the primary anchor positions, preferred residues are those associated with an 
average binding capacity within 10-fold of the optimal residue at the same position. 

15 Tolerated primary anchor residues are those associated with an average binding capacity 
between 10- and 100-fold of the optimal residue at the same position. 

Figure 6 . Map of the A*0203 motif. Summary maps of A*0203 motif for 9-mer 
(a) and 10-mer (b) peptides. At secondary anchor positions, residues shown as preferred 
(or deleterious) are associated with an average binding capacity at least 3-fold greater 

20 than (or 3-fold less than) peptides of the same size carrying other residues at the same 

position. At the primary anchor positions, preferred residues are those associated with an 
average binding capacity within 10-fold of the optimal residue at the same position. 
Tolerated primary anchor residues are those associated with an average binding capacity 
between 10- and 100-fold of the optimal residue at the same position. 

25 Figure 7 . Map of the A*0206 motif. Summary maps of A*0206 motif for 9-mer 

(a) and 10-mer (b) peptides. At secondary anchor positions, residues shown as preferred 
(or deleterious) are associated with an average binding capacity at least 3-fold greater 
than (or 3-fold less than) peptides of the same size carrying other residues at the same 
position. At the primary anchor positions, preferred residues are those associated with an 

30 average binding capacity within 1 0-fold of the optimal residue at the same position. 
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Tolerated primary anchor residues are those associated with an average binding capacity 
between 10- and 100-fold of the optimal residue at the same position. 

Figure 8 . Map of the A*6802 motif. Summary maps of A*6802 motif for 9-mer 
(a) and 10-mer (b) peptides. At secondary anchor positions, residues shown as preferred 
5 (or deleterious) are associated with an average binding capacity at least 3-fold greater 
than (or 3-fold less than) peptides of the same size carrying other residues at the same 
position. At the primary anchor positions, preferred residues are those associated with an 
average binding capacity within 10-fold of the optimal residue at the same position. 
Tolerated primary anchor residues are those associated with an average binding capacity 

10 between 10- and 100-fold of the optimal residue at the same position. 

Figure 9 . A2 supermotif consensus summary of secondary and primary anchor 
influences on A2-supertype binding capacity of 9-(a) and 10-mer (b) peptides. Residues 
shown significantly influence binding to 3 or more A2-supertype molecules. The number 
of molecules influenced are indicated in parentheses. At secondary anchor positions, 

15 residues are considered preferred only if they do not have a deleterious influence on more 
than one molecule. Preferred residues which were deleterious in the context of one 
molecule are indicated by reduced and italicized font. Assessment at the primary anchor 
positions are based on single substitution and peptide library analyses, as discussed in the 
text. 

20 Description of the Preferred Embodiments 

[30] The present invention relates in part to an epitope-based approach for 
vaccine design. Such an approach is based on the well-established finding that the 
mechanism for inducing CTL immune response comprises the step of presenting a CTL 
epitope as a peptide of about 8-11 amino acids bound to an HLA molecule displayed on 

25 an antigen-presenting cell. The HLA molecule is the product of a class I MHC wherein 
the product is expressed on most nucleated cells. 

[31] The products of the MHC class I alleles are generically characterized as A, 
B and C HLA molecules. Within each of these categories, there is a multiplicity of allelic 
variants in the population; indeed, there are believed to be well over 500 class I and 

30 class II alleles. Since a cytotoxic T-cell response cannot be elicited unless the epitope is 
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presented by the class I HLA contained on the surface of the cells of the individual to be 
immunized, it is important that the epitope be one that is capable of binding the HLA 
exhibited by that individual. 

[32] The starting point, therefore, for the design of effective vaccines is to 
5 ensure that the vaccine will generate a large number of epitopes that can successfully be 
presented. It may be possible to administer the peptides representing the epitopes per se. 
Such administration is dependent on the presentation of "empty" HLA molecules 
displayed on the cells of the subject. In one approach to use of the immunogenic peptides 
per se, these peptides may be incubated with antigen-presenting cells from the subject to 

10 be treated ex vivo and the cells then returned to the subject. 

[33] Alternatively, the 8-1 1 amino acid peptide can be generated in situ by 
administering a nucleic acid containing a nucleotide sequence encoding it. Means for 
providing such nucleic acid molecules are described in WO 99/58658, the disclosure of 
which is incorporated herein by reference. Further, the immunogenic peptides can be 

15 administered as portions of a larger peptide molecule and cleaved to release the desired 
peptide. The larger peptide may contain extraneous amino acids, in general the fewer the 
better. Thus, peptides which contain such amino acids are typically 25 amino acids or 
less, more typically 20 amino acids or less, and more typically 15 amino acids or less. 
The precursor may also be a heteropolymer or homopolymer containing a multiplicity of 

20 different or same CTL epitopes. Of course, mixtures of peptides and nucleic acids which 
generate a variety of immunogenic peptides can also be employed. The design of the 
peptide vaccines, the nucleic acid molecules, or the hetero- or homo-polymers is 
dependent on the inclusion of the desired epitope. The present invention provides a 
paradigm for identifying the relevant epitope which is effective across the broad 

25 population range of individuals who are characterized by the A2 supertype. The 

following pages describe the methods and results of experiments for identification of the 
A2 supermotif. 

[34] It is preferred that peptides include an epitope that binds to an HLA-A2 
supertype allele. These motifs may be used to define T-cell epitopes from any desired 
30 antigen, particularly those associated with human viral diseases, cancers or autoimmune 
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diseases, for which the amino acid sequence of the potential antigen or autoantigen 
targets is known. 

[35] Epitopes on a number of potential target proteins can be identified based 
upon HLA binding motifs. Examples of suitable antigens include prostate specific 
5 antigen (PSA), hepatitis B core and surface antigens (HBVc, HBVs) hepatitis C antigens, 
Epstein-Barr virus antigens, melanoma antigens (e.g., MAGE-1), human 
immunodeficiency virus (HIV) antigens, human papilloma virus (HPV) antigens, p53, 
CEA, trypanosome surface antigen (TSA), and Her2/neu. 

[36] Peptides comprising the epitopes from these antigens may be synthesized 

10 and then tested for their ability to bind to the appropriate MHC molecules in assays using, 
for example, purified class I molecules and radioiodonated peptides and/or cells 
expressing empty class I molecules by, for instance, immunofluorescent staining and 
flow microfluorometry, peptide-dependent class I assembly assays, and inhibition of CTL 
recognition by peptide competition. Those peptides that bind to the class I molecule may 

15 be further evaluated for their ability to serve as targets for CTLs derived from infected or 
immunized individuals, as well as for their capacity to induce primary in vitro or in vivo 
CTL responses that can give rise to CTL populations capable of reacting with virally 
infected target cells or tumor cells as potential therapeutic agents. 

[37] The MHC class I antigens are encoded by the HLA-A, B, and C loci. 

20 HLA-A and B antigens are expressed at the cell surface at approximately equal densities, 
whereas the expression of HLA-C is significantly lower (perhaps as much as 10-fold 
lower). Each of these loci have a number of alleles. The peptide binding motifs of the 
invention are relatively specific for each allelic subtype. 

[38] For peptide-based vaccines, peptides preferably comprise a motif 

25 recognized by an MHC I molecule having a wide distribution in the human population, or 
comprise a motif recognized by a genetically diverse population. Since the MHC alleles 
occur at different frequencies within different ethnic groups and races, the choice of 
target MHC allele may depend upon the target population. Table 1 shows the frequency 
of various alleles at the HLA-A locus products among different races. For instance, the 

30 majority of the Caucasoid population can be covered by peptides which bind to four 
HLA-A allele subtypes, specifically HLA-A2.1, Al, A3.2, and A24.1. Similarly, the 



majority of the Asian population is encompassed with the addition of peptides binding to 
a fifth allele HLA-A1 1.2. 









TABLE 1 




5 


Al 


10.1(7) 


1.8(1) 


27.4(138) 




A2.1 


11.5(8) 


37.0(20) 


39.8(199) 




A2.2 


10.1(7) 


0 


3.3(17) 




A2.3 


1.4(1) 


5.5(3) 


0.8(4) 




A2.4 


- 


- 


- 


10 


A2.5 


- 


- 


_ 




A3.1 


1.4(1) 


0 


0.2(0) 




A3.2 


5.7(4) 


5.5(3) 


21.5(108) 




All.l 


0 


5.5(3) 


0 




A11.2 


5.7(4) 


31.4(17) 


8.7(44) 


15 


A11.3 


0 


3.7(2) 


0 




A23 


4.3(3) 


- 


3.9(20) 




A24 


2.9(2) 


27.7(15) 


15.3(77) 




A24.2 


- 


- 


- 




A24.3 


- 


- 


- 


20 


A25 


1.4(1) 


- 


6.9(35) 




A26.1 


4.3(3) 


9.2(5) 


5.9(30) 




A26.2 


7.2(5) 


- 


1.0(5) 




A26V 


- 


3.7(2) 






A28.1 


10.1(7) 


- 


1.6(8) 


25 


A28.2 


1.4(1) 


- 


7.5(38) 




A29.1 


1.4(1) 


- 


1.4(7) 




A29.2 


10.1(7) 


1.8(1) 


5.3(27) 




A30.1 


8.6(6) 




4.9(25) 




A30.2 


1.4(1) 


- 


0.2(1) 


30 


A30.3 


7.2(5) 




3.9(20) 




A31 


4.3(3) 


7.4(4) 


6.9(35) 




A32 


2.8(2) 




7.1(36) 




Aw33.1 


8.6(6) 




2.5(13) 




Aw33.2 


2.8(2) 


16.6(9) 


1.2(6) 


35 


Aw34.1 


1.4(1) 








Aw34.2 


14.5(10) 




0.8(4) 




Aw36 


5.9(4) 







Table compiled from B. DuPont, Immunobiology of HLA. Vol. I, Histocompatibility 
40 Testing 1987, Springer-Verlag, New York 1989. 

* N = Negroid; A = Asian; C = Caucasoid. Numbers in 
parenthesis represent the number of individuals included in the analysis. 
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[39] Cross-reactive binding of HLA-A2. 1 motif-bearing peptides with other 
HLA-A2 allele-specific molecules can occur. Those allele-specific molecules that share 
binding specificities with HLA-A2. 1 are deemed to comprise the HLA-A2.1 supertype. 
The B pocket of A2 supertype HLA molecules is characterized by a consensus motif 
5 including residues (this nomenclature uses single letter amino acid codes, where the 
subscript indicates peptide position) F/Y 9 , A 24 , M45, E/N 6 3, K/N 6 6, V 67 , H/Q 70 and Y/C99. 
Similarly, the A2-supertype F pocket is characterized by a consensus motif including 
residues D 77 , T 8 o, L 8 i and Yn 6 (155). About 66% of the peptides binding A*0201 will be 
cross-reactive amongst three or more A2-supertype alleles. 

1 0 [40] The A2 supertype as defined herein is consistent with cross-reactivity 

data, (Fruci, D. et al 9 Hum. Immunol 38:187, 1993), from live cell binding assays (del 
Guercio, M.-F. et al 9 J. Immunol 154:685, 1995) and data obtained by sequencing 
naturally processed peptides (Sudo, T., et aL 9 J. Immunol 155:4749, 1995) bound to 
HLA-A2 allele-specific molecules. Accordingly the family of HLA molecules (i.e., the 

15 HLA-A2 supertype that binds these peptides) is comprised of at least nine HLA- A 

proteins: A*0201, A*0202, A*0203, A*0204, A*0205, A*0206, A*0207, A*6802, and 
A*6901. 

[41] As described herein, the HLA-A2 supermotif comprises peptide ligands 
with L, I, V, M, A, T, or Q as primary anchor residues at position 2 and L, I, V, M, A, or 
20 T as a primary anchor residue at the C-terminal position of the epitope. HLA-A2 motifs 
that are most particularly relevant to the invention claimed here comprise V, A, T, or Q at 
position two and L, I, V, M, A, or T at the C-terminal anchor position. A peptide epitope 
comprising an HLA-A2 supermotif may bind more than one HLA-A2 supertype 
molecule. 

25 [42] A procedure which may be used to identify peptides of the present 

invention is disclosed in Falk, et al 9 Nature 351:290 (1991), incorporated herein by 
reference. Briefly, the methods involve large-scale isolation of MHC class I molecules, 
typically by immunoprecipitation or affinity chromatography, from appropriate cell or 
cell line. Examples of other methods for isolation of the desired MHC molecule equally 

30 well known to the artisan include ion exchange chromatography, lectin chromatography, 
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size exclusion, high performance ligand chromatography, and a combination of all of the 
above techniques. 

[43] In a typical case, immunoprecipitation may be used to isolate the desired 
allele. A number of protocols can be used, depending upon the specificity of the 
5 antibodies used. For example, allele-specific mAb reagents can be used for the affinity 
purification of the HLA-A, HLA-B1, and HLA-C molecules. Several mAb reagents for 
the isolation of HLA-A molecules are available. The monoclonal BB7.2 is suitable for 
isolating HLA-A2 molecules. Affinity columns prepared with these mAbs using standard 
techniques are successfully used to purify the respective HLA-A allele products. 
1 0 [44] In addition to allele-specific mAbs, broadly reactive anti-HLA-A, B, C 

mAbs, such as W6/32 and B9.12.1, and one anti-HLA-B, C mAb, Bl.23.2, could be used 
in alternative affinity purification protocols as described in the example section below. 

[45] The peptides bound to the peptide binding groove of the isolated MHC 
molecules are eluted typically using acid treatment. Peptides can also be dissociated 
1 5 from class I molecules by a variety of standard denaturing means, such as heat, pH, 
detergents, salts, chaotropic agents, or a combination thereof. 

[46] Peptide fractions are further separated from the MHC molecules by 
reversed-phase high performance liquid chromatography (HPLC) and sequenced. 
Peptides can be separated by a variety of other standard means well known to the artisan, 
20 including filtration, ultrafiltration, electrophoresis, size chromatography, precipitation 
with specific antibodies, ion exchange chromatography, isoelectrofocusing, and the like. 

[47] Sequencing of the isolated peptides can be performed according to 
standard techniques such as Edman degradation (Hunkapiller, M.W., et aL, Methods 
EnzymoL 91, 399 [1983]). Other methods suitable for sequencing include mass 
25 spectrometry sequencing of individual peptides as previously described (Hunt, et al , 
Science 225:1261 (1992), which is incorporated herein by reference). Amino acid 
sequencing of bulk heterogeneous peptides (e.g., pooled HPLC fractions) from different 
class I molecules typically reveals a characteristic sequence motif for each class I allele. 
[48] Definition of motifs specific for different class I alleles allows the 
30 identification of potential peptide epitopes from an antigenic protein whose amino acid 
sequence is known. Typically, identification of potential peptide epitopes is initially 

15 



carried out using a computer to scan the amino acid sequence of a desired antigen for the 
presence of motifs. 

[49] Following identification of motif-bearing epitopes, the epitopic sequences 
are then synthesized. The capacity to bind MHC Class molecules is measured in a 
5 variety of different ways. One means is a Class I molecule binding assay as described in 
the related applications, noted below. Other alternatives described in the literature 
include inhibition of antigen presentation (Sette, et al 9 1 Immunol 141 :3893 (1991), in 
vitro assembly assays (Townsend, et al., Cell 62:285 (1990), and FACS based assays 
using mutated cells, such as RMA.S (Melief, et al, Eur. J. Immunol 21:2963 (1991)). 

1 0 [50] As disclosed herein, higher HLA binding affinity is correlated with greater 

immunogenicity. Greater immunogenicity can be manifested in several different ways. 
Immunogenicity can correspond to whether an immune response is elicited at all, and to 
the vigor of any particular response, as well as to the extent of a diverse population in 
which a response is elicited. For example, a peptide might elicit an immune response in a 

1 5 diverse array of the population, yet in no instance produce a vigorous response. In 

accordance with the principles disclosed herein, close to 90% of high binding peptides 
have been found to be immunogenic, as contrasted with about 50% of the peptides which 
bind with intermediate affinity. Moreover, higher binding affinity peptides lead to more 
vigorous immunogenic responses. As a result, less peptide is required to elicit a similar 

20 biological effect if a high affinity binding peptide is used. Thus, in preferred 

embodiments of the invention, high affinity binding epitopes are particularly useful. 
Nevertheless, substantial improvements over the prior art are achieved with intermediate 
or high binding peptides. 

[51] The relationship between binding affinity for HLA class I molecules and 

25 immunogenicity of discrete peptide epitopes has been determined for the first time in the 
art by the present inventors. In these experiments, in which discrete peptides were 
referred to, it is to be noted that cellular processing of peptides in vivo will lead to such 
peptides even if longer fragments are used. Accordingly, longer peptides comprising one 
or more epitopes are within the scope of the invention. The correlation between binding 

30 affinity and immunogenicity was analyzed in two different experimental approaches 
(Sette, et al, 1 Immunol 153:5586-5592, 1994). In the first approach, the 



immunogenicity of potential epitopes ranging in HLA binding affinity over a 10,000-fold 
range was analyzed in HLA-A*0201 transgenic mice. In the second approach, the 
antigenicity of approximately 100 different hepatitis B virus (HBV)-derived potential 
epitopes, all carrying A*0201 binding motifs, was assessed by using PBL (peripheral 
5 blood lymphocytes) from acute hepatitis patients. Pursuant to these approaches, it was 
determined that an affinity threshold value of approximately 500 nM (preferably 50 nM 
or less) is correlated with the capacity of a peptide epitope to elicit a CTL response. 
These data are true for class I binding affinity measurements for naturally processed 
peptides and for synthesized T-cell epitopes. These data also indicate the important role 

10 of determinant selection in the shaping of T-cell responses (see, e.g., Schaeffer, et aL, 
Proa Natl. Acad. Sci. USA 86:4649-4653, 1989). 

[52] Accordingly, CTL-inducing peptides preferably include those that have an 
IC 50 for class I HLA molecules of 500 nM or less. In the case of motif-bearing peptide 
epitopes from tumor associated antigens, a binding affinity threshold of 200 nM has been 

15 shown to be associated with killing of tumor cells by resulting CTL populations. 

[53] In a preferred embodiment, following assessment of binding activity for 
an HLA-A2 allele-specific molecule, peptides exhibiting high or intermediate affinity are 
then considered for further analysis. Selected peptides may be tested on other members 
of the supertype family. In preferred embodiments, peptides that exhibit cross-reactive 

20 binding are then used in vaccines or in cellular screening analyses. 

[54] For example, peptides that test positive in the HLA-A2 binding assay, z.e., 
that have binding affinity values of 500 nM or less, are assayed for the ability of the 
peptides to induce specific CTL responses in vitro. For instance, antigen-presenting cells 
that have been incubated with a peptide can be assayed for the ability to induce CTL 

25 responses in responder cell populations. Antigen-presenting cells can be normal cells 
such as peripheral blood mononuclear cells or dendritic cells (Inaba, et ah, J. Exp. Med. 
166:182 (1987); Boog, Eur. J. Immunol. 18:219 [1988]). 

[55] Alternatively, mutant mammalian cell lines that are deficient in their 
ability to load class I molecules with internally processed peptides, such as the mouse cell 

30 lines RMA-S (Karre, et al. y Nature, 319:675 (1986); Ljunggren, et al, Eur. J. Immunol 
21 :2963-2970 (1991)), and the human somatic T-cell hybrid, T-2 (Cerundolo, et a/., 



Nature 345:449-452 (1990)) and which have been transfected with the appropriate genes 
which encode human class I molecules are conveniently used, when peptide is 
exogenously added to them, to test for the capacity of the peptide to induce in vitro 
primary CTL responses. Other eukaryotic cell lines that can be used include various 
5 insect cell lines such as mosquito larvae (ATCC cell lines CCL 125, 126, 1660, 1591, 
6585, 6586), silkworm (ATTC CRL 8851), armywoim (ATCC CRL 171 1), moth (ATCC 
CCL 80) and Drosophila cell lines such as a Schneider cell line (see Schneider J. 
Embryol. Exp. Morphol 27:353-365 [1927]). 

[56] Peripheral blood lymphocytes are conveniently isolated following simple 

10 venipuncture or leukapheresis of normal donors or patients and used as the responder cell 
sources of CTL precursors. In one embodiment, the appropriate antigen-presenting cells 
are incubated with 10-100 fiM of peptide in serum-free media for 4 hours under 
appropriate culture conditions. The peptide-loaded antigen-presenting cells are then 
incubated with the responder cell populations in vitro for 7 to 10 days under optimized 

1 5 culture conditions. Positive CTL activation can be determined by assaying the cultures 
for the presence of CTLs that kill radiolabeled target cells, both specific peptide-pulsed 
targets as well as target cells expressing endogenously processed form of the relevant 
virus or tumor antigen from which the peptide sequence was derived. 

[57] Specificity and MHC restriction of the CTL is determined by testing 

20 against different peptide target cells expressing appropriate or inappropriate human MHC 
class I. The peptides that test positive in the MHC binding assays and give rise to 
specific CTL responses are referred to herein as immunogenic peptides. 

[58] Kast, et al (J. Immunol 152:3904-3912, 1994) have shown that motif- 
bearing peptides account for 90% of the epitopes that bind to allele-specific HLA class I 

25 molecules. In this study all possible peptides of 9 amino acids in length and overlapping 
by eight amino acids (240 peptides), which cover the entire sequence of the E6 and E7 
proteins of human papillomavirus type 16, were evaluated for binding to five allele- 
specific HLA molecules that are expressed at high frequency among different ethnic 
groups. This unbiased set of peptides allowed an evaluation of the predictive value of 

30 HLA class I motifs. From the set of 240 peptides, 22 peptides were identified that bound 
to an allele-specific HLA molecules with high or intermediate affinity. Of these 22 



peptides, 20, (i.e. 91%), were motif-bearing. Thus, this study demonstrated the value of 
motifs for the identification of peptide epitopes for inclusion in a vaccine: application of 
motif-based identification techniques eliminates screening of 90% of the potential 
epitopes. The quantity of available peptides, and the complexity of the screening process 
5 would make a comprehensive evaluation of an antigen highly difficult, if not impossible 
without use of motifs. 

[59] An immunogenic peptide epitope of the invention may be included in a 
polyepitopic vaccine composition comprising additional peptide epitopes of the same 
antigen, antigens from the same source, and/or antigens from a different source. 

10 Moreover, class II epitopes can be included along with class I epitopes. Peptide epitopes 
from the same antigen may be adjacent epitopes that are contiguous in sequence or may 
be obtained from different regions of the protein. 

[60] As noted in greater detail below, the immunogenic peptides can be 
prepared synthetically, such as by chemical synthesis or by recombinant DNA 

15 technology, or isolated from natural sources such as whole viruses or tumors. Although 
the peptide will preferably be substantially free of other naturally occurring host cell 
proteins and fragments thereof, in some embodiments the peptides can be synthetically 
conjugated to native fragments or particles. 

[61] The polypeptides or peptides can be a variety of lengths, either in their 

20 neutral (uncharged) forms or in forms which are salts, and either free of modifications 
such as glycosylation, side chain oxidation, or phosphorylation or containing these 
modifications, subject to the condition that the modification not destroy the biological 
activity of the polypeptides as herein described. 

[62] Desirably, the peptide will be as small as possible while still maintaining 

25 substantially all of the biological activity of the large peptide. When possible, it may be 
desirable to optimize peptide epitopes of the invention to a length of 9 or 10 amino acid 
residues, commensurate in size with endogenously processed viral peptides or tumor cell 
peptides that are bound to MHC class I molecules on the cell surface. 

[63] Peptides having the desired activity may be modified as necessary to 

30 provide certain desired attributes, e.g., improved pharmacological characteristics, while 
increasing or at least retaining substantially all of the biological activity of the 



unmodified peptide to bind the desired MHC molecule and activate the appropriate T- 
cell For instance, the peptides may be subject to various changes, such as substitutions, 
either conservative or non-conservative, where such changes might provide for certain 
advantages in their use, such as improved MHC binding. By conservative substitutions is 
5 meant replacing an amino acid residue with another which is biologically and/or 

chemically similar, e.g., one hydrophobic residue for another, or one polar residue for 
another. The substitutions include combinations such as Gly, Ala; Val, He, Leu, Met; 
Asp, Glu; Asn, Gin; Ser, Thr; Lys, Arg; and Phe, Tyr. The effect of single amino acid 
substitutions may also be probed using D-amino acids. Such modifications may be made 

10 using well known peptide synthesis procedures, as described in e.g., Merrifield, Science 
232:341-347 (1986), Barany and Merrifield, The Peptides, Gross and Meienhofer, eds. 
(N.Y., Academic Press), pp. 1-284 (1979); and Stewart and Young, Solid Phase Peptide 
Synthesis, (Rockford, 111., Pierce), 2d Ed. (1984). 

[64] The peptides can also be modified by extending or decreasing the 

15 compound's amino acid sequence, e.g., by the addition or deletion of amino acids. The 
peptides or analogs of the invention can also be modified by altering the order or 
composition of certain residues, it being readily appreciated that certain amino acid 
residues essential for biological activity, e.g., those at critical contact sites or conserved 
residues, may generally not be altered without an adverse effect on biological activity. 

20 The non-critical amino acids need not be limited to those naturally occurring in proteins, 
such as L-a-amino acids, but may include non-natural amino acids as well, such as /?-y-5- 
amino acids, as well as many derivatives of L-a-amino acids such as D-isomers of natural 
amino acids. 

[65] Typically, a series of peptides with single amino acid substitutions are 
25 employed to determine the effect of electrostatic charge, hydrophobicity, etc. on binding. 
For instance, a series of positively charged (e.g., Lys or Arg) or negatively charged (e.g., 
Glu) amino acid substitutions are made along the length of the peptide revealing different 
patterns of sensitivity towards various MHC molecules and T-cell receptors. In addition, 
multiple substitutions using small, relatively neutral moieties such as Ala, Gly, Pro, or 
30 similar residues may be employed. The substitutions may produce multi-epitopic 
peptides which are homo-oligomers or hetero-oligomers. The number and types of 

20 



residues which are substituted or added depend on the spacing necessary between 
essential contact points and certain functional attributes which are sought (e.g., 
hydrophobicity versus hydrophilicity). Increased binding affinity for an MHC molecule 
or T-cell receptor may also be achieved by such substitutions, compared to the affinity of 
5 the parent peptide. In any event, such substitutions generally employ amino acid residues 
or other molecular fragments chosen to avoid, for example, steric and charge interference 
which might disrupt binding. 

[66] Amino acid substitutions are typically of single residues. Substitutions, 
deletions, insertions or any combination thereof may be combined to arrive at a final 
10 peptide. Substitutional variants are those in which at least one residue of a peptide has 
been removed and a different residue inserted in its place. Such substitutions generally 
are made in accordance with the following Table 2 when it is desired to finely modulate 
the characteristics of the peptide. 

TABLE 2 



15 


Original Residue 


Exemplary Substitution 




Ala 


Ser 




Arg 


Lys, His 




Asn 


Gin 




Asp 


Glu 


20 


Cys 


Ser 




Gin 


Asn 




Glu 


Asp 




Gly 


Pro 




His 


Lys; Arg 


25 


He 


Leu; Val 




Leu 


lie; Val 




Lys 


Arg; His 




Met 


Leu; He 




Phe 


Tyr; Trp 


30 


Ser 


Thr 




Thr 


Ser 




Trp 


Tyr; Phe 




Tyr 


Trp; Phe 




Val 


He; Leu 



35 



[67] Substantial changes in function (e.g., affinity for MHC molecules or T-cell 
receptors) are made by selecting substitutions that are less conservative than those in 
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Table 2, i.e., selecting residues that differ more significantly in their effect on 
maintaining (a) the structure of the peptide backbone in the area of the substitution, for 
example as a sheet or helical conformation, (b) the charge or hydrophobicity of the 
molecule at the target site or (c) the bulk of the side chain. The substitutions which in 
5 general are expected to produce the greatest changes in peptide properties will be those in 
which (a) hydrophilic residue, e.g. seryl, is substituted for (or by) a hydrophobic residue, 
e.g. leucyl, isoleucyl, phenylalanyl, valyl or alanyl; (b) a residue having an 
electropositive side chain, e.g., lysl, arginyl, or histidyl, is substituted for (or by) an 
electronegative residue, e.g. glutamyl or aspartyl; or (c) a residue having a bulky side 
10 chain, e.g. phenylalanine, is substituted for (or by) one not having a side chain, e.g., 
glycine. 

[68] The peptides may also comprise isosteres of two or more residues in the 
immunogenic peptide. An isostere as defined here is a sequence of two or more residues 
that can be substituted for a second sequence because the steric conformation of the first 

1 5 sequence fits a binding site specific for the second sequence. The term specifically 
includes peptide backbone modifications well known to those skilled in the art. Such 
modifications include modifications of the amide nitrogen, the ce-carbon, amide carbonyl, 
complete replacement of the amide bond, extensions, deletions or backbone crosslinks. 
See, generally, Spatola, Chemistry and Biochemistry of Amino Acids, Peptides and 

20 Proteins, Vol VII (Weinstein ed. ? 1983). 

[69] Modifications of peptides with various amino acid mimetics or unnatural 
amino acids are particularly useful in increasing the stability of the peptide in vivo. 
Stability can be assayed in a number of ways. For instance, peptidases and various 
biological media, such as human plasma and serum, have been used to test stability. See, 

25 e.g., Verhoef, et al 9 Eur. J. DrugMetab. Pharmacokin. 1 1 :291-302 (1986). Half life of 
the peptides of the present invention is conveniently determined using a 25% human 
serum (v/v) assay. The protocol is generally as follows. Pooled human serum (Type AB, 
non-heat inactivated) is delipidated by centrifugation before use. The serum is then 
diluted to 25% with RPMI tissue culture media and used to test peptide stability. At 

30 predetermined time intervals a small amount of reaction solution is removed and added to 
either 6% aqueous trichloracetic acid or ethanol. The cloudy reaction sample is cooled 
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(4°C) for 15 minutes and then spun to pellet the precipitated serum proteins. The 
presence of the peptides is then determined by reversed-phase HPLC using stability- 
specific chromatography conditions. 

[70] The peptides of the present invention or analogs thereof which have CTL 
5 stimulating activity may be modified to provide desired attributes other than improved 
serum half life. For instance, the ability of the peptides to induce CTL activity can be 
enhanced by linkage to a sequence which contains at least one epitope that is capable of 
inducing a T helper cell response. 

[71] In some embodiments, the T helper peptide is one that is recognized by T 

10 helper cells in the majority of the population. This can be accomplished by selecting 
amino acid sequences that bind to many, most, or all of the MHC class II molecules. 
These are known as "loosely MHC-restricted" T helper sequences. Examples of amino 
acid sequences that are loosely MHC-restricted include sequences from antigens such as 
Tetanus toxin at positions 830-843 (QYIKANSKFIGITE), Plasmodium falciparum 

1 5 circumsporozoite (CS) protein at positions 378-398 (DIEKKIAKMEKASS VFNWNS), 
and Streptococcus 18kD protein at positions 1-16 (YGAVDSILGGVATYGAA). 

[72] Alternatively, it is possible to prepare synthetic peptides capable of 
stimulating T helper lymphocytes, in a loosely MHC-restricted fashion, using amino acid 
sequences not found in nature. These synthetic compounds, called Pan-DR-binding 

20 epitopes or PADRE™ molecules (Epimmune, San Diego, CA), are designed on the basis 
of their binding activity to most HLA-DR (human MHC class II) molecules {see, e.g., 
U.S. Patent Number 5,736,142). 

[73] Particularly preferred immunogenic peptides/T helper conjugates are 
linked by a spacer molecule. The spacer is typically comprised of relatively small, 

25 neutral molecules, such as amino acids or amino acid mimetics, which are substantially 
uncharged under physiological conditions. The spacers are typically selected from, e.g., 
Ala, Gly, or other neutral spacers of nonpolar amino acids or neutral polar amino acids. 
It will be understood that the optionally present spacer need not be comprised of the same 
residues and thus may be a hetero- or homo-oligomer. When present, the spacer will 

30 usually be at least one or two residues, more usually three to six residues. Alternatively, 
the CTL peptide may be linked to the T helper peptide without a spacer. 
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[74] The immunogenic peptide may be linked to the T helper peptide, either 
directly or via a spacer, at the amino or carboxy terminus of the CTL peptide. The amino 
terminus of either the immunogenic peptide or the T helper peptide may be acylated. 
Exemplary T helper peptides include tetanus toxoid 830-843, influenza 307-319, malaria 
5 circumsporozoite 382-398 and 378-389. 

[75] In some embodiments it may be desirable to include in the pharmaceutical 
compositions of the invention at least one component which primes CTL. Lipids have 
been identified as agents capable of priming CTL in vivo against viral antigens. For 
example, palmitic acid residues can be attached to the alpha and epsilon amino groups of 

10 a Lys residue and then linked, e.g., via one or more linking residues such as Gly, Gly- 

Gly-, Ser, Ser-Ser, or the like, to an immunogenic peptide. The lipidated peptide can then 
be injected directly in a micellar form, incorporated into a liposome or emulsified in an 
adjuvant, e.g., incomplete Freund ! s adjuvant. In a preferred embodiment a particularly 
effective immunogen comprises palmitic acid attached to alpha and epsilon amino groups 

15 of Lys, which is attached via linkage, e.g., Ser-Ser, to the amino terminus of the 
immunogenic peptide. 

[76] As another example of lipid priming of CTL responses, E. coli 
lipoproteins, such as tripalmitoyl-S-glycerylcysteinlyseryl-serine (P3CSS) can be used to 
prime virus specific CTL when covalently attached to an appropriate peptide. See, Deres, 

20 et al, Nature 342:561-564 (1989). Peptides of the invention can be coupled to P 3 CSS, 
for example, and the lipopeptide administered to an individual to specifically prime a 
CTL response to the target antigen. Further, as the induction of neutralizing antibodies 
can also be primed with P3CSS conjugated to a peptide which displays an appropriate 
epitope, the two compositions can be combined to more effectively elicit both humoral 

25 and cell-mediated responses to infection. 

[77] In addition, additional amino acids can be added to the termini of a peptide 
to provide for ease of linking peptides one to another, for coupling to a carrier support, or 
larger peptide, for modifying the physical or chemical properties of the peptide or 
oligopeptide, or the like. Amino acids such as tyrosine, cysteine, lysine, glutamic or 

30 aspartic acid, or the like, can be introduced at the C- or N-terminus of the peptide or 
oligopeptide. Modification at the C terminus in some cases may alter binding 
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characteristics of the peptide. In addition, the peptide or oligopeptide sequences can 
differ from the natural sequence by being modified by terminal-NH2 acylation, e.g., by 
alkanoyl (C1-C20) or thioglycolyl acetylation, terminal-carboxyl amidation, e.g., 
ammonia, methylamine, etc. In some instances these modifications may provide sites for 
5 linking to a support or other molecule. 

[78] The peptides of the invention can be prepared in a wide variety of ways. 
Because of their relatively short size, the peptides (discrete epitopes or polyepitopic 
peptides) can be synthesized in solution or on a solid support in accordance with 
conventional techniques. Various automatic synthesizers are commercially available and 

10 can be used in accordance with known protocols. See, for example, Stewart and Young, 
Solid Phase Peptide Synthesis, 2d. ed„ Pierce Chemical Co. (1 984), supra. 

[79] Alternatively, preparation of peptides of the invention can comprise use of 
recombinant DNA technology wherein a nucleotide sequence which encodes an 
immunogenic peptide of interest is inserted into an expression vector, transformed or 

1 5 transfected into an appropriate host cell and cultivated under conditions suitable for 
expression. These procedures are generally known in the art, as described generally in 
Sambrook, et ah, Molecular Cloning, A Laboratory Manual, Cold Spring Harbor Press, 
Cold Spring Harbor, New York (1 982), which is incorporated herein by reference. Thus, 
fusion proteins which comprise one or more peptide sequences of the invention can be 

20 used to present the appropriate T-cell epitope. 

[80] As the coding sequence for peptides of the length contemplated herein can 
be synthesized by chemical techniques, for example, the phosphotriester method of 
Matteucci, et al, J. Am. Chem. Soc. 103:3185 (1981), modification can be made simply 
by substituting the appropriate base(s) for those encoding the native peptide sequence. 

25 The coding sequence can then be provided with appropriate linkers and ligated into 
expression vectors commonly available in the art, and the vectors used to transform 
suitable hosts to produce the desired fusion protein. A number of such vectors and 
suitable host systems are now available. For expression of the fusion proteins, the coding 
sequence will be provided with operably linked start and stop codons, promoter and 

30 terminator regions and usually a replication system to provide an expression vector for 
expression in the desired cellular host. For example, promoter sequences compatible 



with bacterial hosts are provided in plasmids containing convenient restriction sites for 
insertion of the desired coding sequence. The resulting expression vectors are 
transformed into suitable bacterial hosts. Of course, yeast or mammalian cell hosts may 
also be used, employing suitable vectors and control sequences. 
5 [81] The peptides of the present invention and pharmaceutical and vaccine 

compositions thereof are useful for administration to mammals, particularly humans, to 
therapeutically treat and/or prevent infections and cancer. Examples of diseases which 
can be treated using the immunogenic peptides of the invention include prostate cancer, 
hepatitis B, hepatitis C, AIDS, renal carcinoma, cervical carcinoma, lymphoma, CMV 

10 infection and condlyloma acuminatum. 

[82] For pharmaceutical compositions, the immunogenic peptides of the 
invention are often administered to an individual already suffering from cancer or 
infected with the virus of interest. Those in the incubation phase or the acute phase of 
infection can be treated with the immunogenic peptides separately or in conjunction with 

15 other treatments, as appropriate. In therapeutic applications, compositions are 

administered to a patient in an amount sufficient to elicit an effective CTL response to the 
infectious disease agent or tumor antigen and to cure or at least partially arrest symptoms 
and/or complications. An amount adequate to accomplish this is defined as a 
"therapeutically effective dose" or "unit dose". Amounts effective for this use will 

20 depend on, e.g., the peptide composition, the manner of administration, the stage and 

severity of the disease being treated, the weight and general state of health of the patient, 
and the judgment of the prescribing physician. Generally for humans the dose range for 
the initial immunization (that is for therapeutic or prophylactic administration) is from 
about 1.0 fig to about 20,000 fig of peptide for a 70 kg patient, preferably, 100 fig 150 

25 fig 200 ^g 250 fig -, 300 fig 400 fig or 500 fig -20,000 fig, followed by boosting 
dosages in the same dose range pursuant to a boosting regimen over weeks to months 
depending upon the patient's response and condition by measuring specific CTL activity 
in the patient's blood. In embodiments where recombinant nucleic acid administration is 
used, the administered material is titrated to achieve the appropriate therapeutic response. 

30 It must be kept in mind that the peptides and compositions of the present invention may 
generally be employed in serious disease states, that is, life-threatening or potentially life 
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threatening situations. In such cases, in view of the minimization of extraneous 
substances in the compositions of the invention and , e.g., the relative nontoxic nature of 
the peptides, it is possible and may be felt desirable by the treating physician to 
administer substantial excesses of these compositions. 
5 [83] For therapeutic use, administration should begin at the first sign of 

infection or the detection or surgical removal of tumors or shortly after diagnosis in the 
case of acute infection. This is followed by boosting doses until at least symptoms are 
substantially abated and for a period thereafter. In chronic infection, loading doses 
followed by boosting doses may be required. 

1 0 [84] Treatment of an infected individual with the compositions of the invention 

may hasten resolution of the infection in acutely infected individuals. For those 
individuals susceptible (or predisposed) to developing chronic infection the compositions 
are particularly useful in methods for preventing the evolution from acute to chronic 
infection. Where the susceptible individuals are identified prior to or during infection, 

15 for instance, as described herein, the composition can be targeted to them, minimizing 
need for administration to a larger population. 

[85] The peptide compositions can also be used for the treatment of chronic 
infection and to stimulate the immune system to eliminate, e.g., virus-infected cells in 
carriers. It is important to provide an amount of immuno-potentiating peptide in a 

20 formulation and mode of administration sufficient to effectively stimulate a cytotoxic T- 
cell response. Thus, for treatment of chronic infection, immunizing doses followed by 
boosting doses at established intervals, e.g., from one to four weeks, may be required, 
possibly for a prolonged period of time to effectively immunize an individual. In the case 
of chronic infection, administration should continue until at least clinical symptoms or 

25 laboratory tests indicate that the infection has been eliminated or substantially abated and 
for a period thereafter. 

[86] The pharmaceutical compositions for therapeutic treatment are intended 
for parenteral, topical, oral or local administration. Peptides of the invention can be 
administered in a form of nucleic acids that encode the peptides. Preferably, the 

30 pharmaceutical compositions are administered parenterally, e.g., intravenously, 
subcutaneously, intradermally, or intramuscularly. Thus, the invention provides 
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compositions for parenteral administration which comprise a solution of the 
immunogenic peptides dissolved or suspended in an acceptable carrier, preferably an 
aqueous carrier. A variety of aqueous carriers may be used, e.g., water, buffered water, 
0.8% saline, 0.3% glycine, hyaluronic acid and the like. These compositions may be 
5 sterilized by conventional, well known sterilization techniques, or may be sterile filtered. 
The resulting aqueous solutions may be packaged for use as is, or lyophilized, the 
lyophilized preparation being combined with a sterile solution prior to administration. 
The compositions may contain pharmaceutically acceptable auxiliary substances as 
required to approximate physiological conditions, such as pH adjusting and buffering 

10 agents, tonicity adjusting agents, wetting agents and the like, for example, sodium 

acetate, sodium lactate, sodium chloride, potassium chloride, calcium chloride, sorbitan 
monolaurate, triethanolamine oleate, etc. 

[87] The concentration of CTL stimulatory peptides of the invention in the 
pharmaceutical formulations can vary widely, i.e., from less than about 0.1%, usually at 

15 or at least about 2% to as much as 20% to 50% or more by weight, and will be selected 
primarily by fluid volumes, viscosities, etc., in accordance with the particular mode of 
administration selected. A human unit dose form of the peptide composition is typically 
included in a pharmaceutical composition that comprises a human unit dose of an 
acceptable carrier, preferably an aqueous carrier, and is administered in a volume of fluid 

20 that is known by those of skill in the art to be used for administration of such 
compositions to humans. 

[88] The peptides of the invention may also be administered via liposomes, 
which serve to target the peptides to a particular tissue, such as lymphoid tissue, or 
targeted selectively to infected cells, as well as increase* the half-life of the peptide 

25 composition. Liposomes include emulsions, foams, micelles, insoluble monolayers, 
liquid crystals, phospholipid dispersions, lamellar layers and the like. In these 
preparations the peptide to be delivered is incorporated as part of a liposome, alone or in 
conjunction with a molecule which binds to, e.g., a receptor prevalent among lymphoid 
cells, such as monoclonal antibodies which bind to the CD45 antigen, or with other 

30 therapeutic or immunogenic compositions. Thus, liposomes either filled or decorated 
with a desired peptide of the invention can be directed to the site of lymphoid cells, 
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where the liposomes then deliver the selected therapeutic/immunogenic peptide 
compositions. Liposomes for use in the invention are formed from standard vesicle- 
forming lipids, which generally include neutral and negatively charged phospholipids and 
a sterol, such as cholesterol The selection of lipids is generally guided by consideration 
5 of, e.g., liposome size, acid lability and stability of the liposomes in the blood stream. A 
variety of methods are available for preparing liposomes, as described in, e.g., Szoka, et 
aL,Ann. Rev. Biophys. Bioeng. 9:467 (1980), U.S. Patent Nos. 4,235,871, 4,501,728, 
4,837,028, and 5,019,369. 

[89] For targeting to the immune cells, a ligand to be incorporated into the 

10 liposome can include, e.g., antibodies or fragments thereof specific for cell surface 

determinants of the desired immune system cells. A liposome suspension containing a 
peptide may be administered intravenously, locally, topically, etc. in a dose which varies 
according to, inter alia, the manner of administration, the peptide being delivered, and the 
stage of the disease being treated. 

1 5 [90] For solid compositions, conventional nontoxic solid carriers may be used 

which include, for example, pharmaceutical grades of mannitol, lactose, starch, 
magnesium stearate, sodium saccharin, talcum, cellulose, glucose, sucrose, magnesium 
carbonate, and the like. For oral administration, a pharmaceutical^ acceptable nontoxic 
composition is formed by incorporating any of the normally employed excipients, such as 

20 those carriers previously listed, and generally 10-95% of active ingredient, that is, one or 
more peptides of the invention, and more preferably at a concentration of 25%-75%. 

[91] For aerosol administration, the immunogenic peptides are preferably 
supplied in finely divided form along with a surfactant and propellant. Typical 
percentages of peptides are 0.01%-20% by weight, preferably 1%-10%. The surfactant 

25 must, of course, be nontoxic, and preferably soluble in the propellant. Representative of 
such agents are the esters or partial esters of fatty acids containing from 6 to 22 carbon 
atoms, such as caproic, octanoic, lauric, palmitic, stearic, linoleic, linolenic, olesteric and 
oleic acids with an aliphatic polyhydric alcohol or its cyclic anhydride. Mixed esters, 
such as mixed or natural glycerides may be employed. The surfactant may constitute 

30 0.1 %-20% by weight of the composition, preferably 0.25-5%. The balance of the 
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composition is ordinarily propellant. A carrier can also be included, as desired, as with, 

e.g., lecithin for intranasal delivery. 

[92] Accordingly, an aspect the present invention is directed to vaccines which 

contain as an active ingredient an immunogenically effective amount of an immunogenic 
5 peptide as described herein. The peptides may also be administered in the form of 

nucleic acids which encode peptides of the invention upon expression in the recipient. 

The peptide(s) may be introduced into a host, including humans, linked to its own carrier 

or as a homopolymer or heteropolymer of active peptide units. Such a polymer has the 

advantage of increased immunological reaction and, where different peptides are used to 
10 make up the polymer, the additional ability to induce antibodies and/or CTLs that react 

with different antigenic determinants of the virus or tumor cells. Useful carriers are well 

known in the art, and include, e.g., thyroglobulin, albumins such as human serum 
I S albumin, tetanus toxoid, polyamino acids such as poly(lysine:glutamic acid), influenza, 

hepatitis B virus core protein, hepatitis B virus recombinant vaccine and the like. The 
fff 15 vaccines can also contain a physiologically tolerable (acceptable) diluent such as water, 
■ fj phosphate buffered saline, or saline, and further typically include an adjuvant. Materials 

W such as incomplete Freund's adjuvant, aluminum phosphate, aluminum hydroxide, or 

i 3 alum are materials well known in the art as adjuvants. And, as mentioned above, CTL 

]yt responses can be primed by conjugating peptides of the invention to lipids, such as 

! U 20 P3CSS. Upon immunization with a peptide composition as described herein, via 
1 T injection, aerosol, oral, transdermal or other route, the immune system of the host 

responds to the vaccine by producing large amounts of CTLs specific for the desired 

antigen, and the host becomes at least partially immune to later infection, or resistant to 

developing chronic infection. 
25 [93] In some instances it may be desirable to combine the peptide vaccines of 

the invention with vaccines which induce neutralizing antibody responses to the virus of 

interest, particularly to viral envelope antigens. 

[94] For therapeutic or immunization purposes, peptides of the invention can 

be administered in the form of nucleic acids encoding one or more of the peptides of the 
30 invention. The nucleic acids can encode a peptide of the invention and optionally one or 

more additional molecules. A number of methods are conveniently used to deliver 
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nucleic acids to a patient. For instance, nucleic acid can be delivered directly, as "naked 
DNA" This approach is described, for instance, in Wolff, et al., Science 247: 1465-1468 
(1990) as well as U.S. Patent Nos. 5,580,859 and 5,589,466. Nucleic acids can also be 
administered using ballistic delivery as described, for instance, in U.S. Patent No. 
5 5,204,253. Particles comprised solely of DNA can be administered. Alternatively, DNA 
can be adhered to particles, such as gold particles. 

[95] The nucleic acids can also be delivered complexed to cationic compounds, 
such as cationic lipids. Lipid-mediated gene delivery methods are described, for 
instance, in WO 96/18372; WO 93/24640; Mannino & Gould-Fogerite, BioTechniques 

10 6(7): 682-691 (1988); Rose U.S. Pat No. 5,279,833; WO 91/06309; and Feigner, et al, 
Proc. Natl. Acad. Sci. USA 84: 1AIZ-1AU (1987). 

[96] The peptides of the invention can also be expressed by attenuated viral 
hosts, such as vaccinia or fowlpox. This approach involves the use of vaccinia virus as a 
vector to express nucleotide sequences that encode the peptides of the invention. Upon 

15 introduction into an acutely or chronically infected host or into a noninfected host, the 
recombinant vaccinia virus expresses the immunogenic peptide, and thereby elicits a host 
CTL response. Vaccinia vectors and methods useful in immunization protocols are 
described in, e.g., U.S. Patent No. 4,722,848. Another vector is BCG (Bacille Calmette 
Guerin). BCG vectors are described, e.g., in Stover, et al {Nature 351:456-460 (1991)). 

20 A wide variety of other vectors useful for therapeutic administration or immunization of 
the peptides of the invention, e.g., Salmonella typhi vectors and the like, will be apparent 
to those skilled in the art from the description herein. 

[97] A preferred means of administering nucleic acids encoding the peptides of 
the invention uses minigene constructs encoding multiple epitopes of the invention 

25 optionally together with other molecules. To create a DNA sequence encoding the 
selected CTL epitopes (minigene) for expression in human cells, the amino acid 
sequences of the epitopes are, e.g., reverse translated. A human codon usage table is used 
to guide the codon choice for each amino acid. These epitope-encoding DNA sequences 
are directly adjoined, creating a molecule that encodes a continuous polypeptide 

30 sequence. Optionally, to optimize expression and/or immunogenicity, additional 
elements can be incorporated into the minigene design. Examples of amino acid 
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sequence that could be reverse translated and included in the minigene sequence include: 
helper T lymphocyte epitopes, a leader (signal) sequence, and an endoplasmic reticulum 
retention signal. In addition, MHC presentation of CTL epitopes may be improved by 
including synthetic (e.g. poly-alanine) or naturally-occurring flanking sequences adjacent 
5 to the CTL epitopes. 

[98] The minigene sequence is converted to DNA by assembling 
oligonucleotides that encode the plus and minus strands of the minigene. Overlapping 
oligonucleotides (30-100 bases long) are synthesized, phosphorylated, purified and 
annealed under appropriate conditions using well known techniques, he ends of the 

10 oligonucleotides are joined using T4 DNA ligase. This synthetic minigene, encoding the 
CTL epitope polypeptide, can then cloned into a desired expression vector. 

[99] Standard regulatory sequences well known to those of skill in the art are 
generally included in the vector to ensure expression in the target cells. Several vector 
elements are required: a promoter with a down-stream cloning site for minigene insertion; 

15 a polyadenylation signal for efficient transcription termination; an E. coli origin of 
replication; and an E. coli selectable marker (e.g. ampicillin or kanamycin resistance). 
Numerous promoters can be used for this purpose, e.g., the human cytomegalovirus 
(hCMV) promoter. See, U.S. Patent Nos. 5,580,859 and 5,589,466 for other suitable 
promoter sequences. 

20 [ 1 00] Additional vector modifications may be desired to optimize minigene 

expression and immunogenicity. In some cases, introns are required for efficient gene 
expression, and one or more synthetic or naturally-occurring introns could be 
incorporated into the transcribed region of the minigene. The inclusion of mRNA 
stabilization sequences can also be considered for increasing minigene expression. It has 

25 recently been proposed that immunostimulatory sequences (ISSs or CpGs) play a role in 
the immunogenicity of DNA vaccines. These sequences could be included in the vector, 
outside the minigene coding sequence, if found to enhance immunogenicity, 

[101] In some embodiments, a bicistromc expression vector, to allow production 
of the minigene-encoded epitopes and a second protein included to enhance or decrease 

30 immunogenicity can be used. Examples of proteins or polypeptides that could 

beneficially enhance the immune response if co-expressed include cytokines (e.g., IL2, 
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IL12, GM-CSF), cytokine-inducing molecules (e.g. LeIF) or costimulatory molecules. 
Moreover, if helper T lymphocyte (HTL) epitopes are employed, the HTL epitopes can 
be joined to intracellular targeting signals and expressed separately from the CTL 
epitopes. This allows direction of the HTL epitopes to a cell compartment different than 
5 the CTL epitopes. This can facilitate more efficient entry of HTL epitopes into the MHC 
class II pathway, thereby facilitating and improving CTL induction. In contrast to CTL 
induction, specifically decreasing the immune response by co-expression of 
immunosuppressive molecules (e.g. TGF-/3) may be beneficial in certain diseases. 

[102] Once an expression vector is selected, the minigene is cloned into the 

10 polylinker region downstream of the promoter. This plasmid is transformed into an 
appropriate E. coli strain, and DNA is prepared using standard techniques. The 
orientation and DNA sequence of the minigene, as well as all other elements included in 
the vector, are confirmed using restriction mapping and DNA sequence analysis. 
Bacterial cells harboring the correct plasmid can be stored as a master cell bank and a 

1 5 working cell bank. 

[103] Therapeutic quantities of plasmid DNA are produced, e.g., by 
fermentation in E. coli, followed by purification. Aliquots from the working cell bank 
are used to inoculate fermentation medium (such as Terrific Broth), and grown to 
saturation in shaker flasks or a bioreactor according to well known techniques. Plasmid 

20 DNA can be purified using standard bioseparation technologies such as solid phase 
anion-exchange resins supplied by Quiagen. If required, supercoiled DNA can be 
isolated from the open circular and linear forms using gel electrophoresis or other 
methods. 

[ 1 04] Purified plasmid DNA can be prepared for injection using a variety of 
25 formulations. The simplest of these is reconstitution of lyophilized DNA in sterile 
phosphate-buffer saline (PBS). A variety of methods have been described, and new 
techniques may become available. As noted above, nucleic acids are conveniently 
formulated with cationic lipids. In addition, glycolipids, fiisogenic liposomes, peptides 
and compounds referred to collectively as protective, interactive, non-condensing (PINC) 
30 could also be complexed to purified plasmid DNA to influence variables such as stability, 
intramuscular dispersion, or trafficking to specific organs or cell types. 
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[105] Target cell sensitization can be used as a functional assay for expression 
and MHC class I presentation of minigene-encoded CTL epitopes. The plasmid DNA 
can be introduced into a mammalian cell line that is suitable as a target for standard CTL 
chromium release assays. The transfection method used will be dependent on the final 
5 formulation. Electroporation can be used for "naked" DNA, whereas cationic lipids 
allow direct in vitro transfection. A plasmid expressing green fluorescent protein (GFP) 
can be co-transfected to allow enrichment of transfected cells using fluorescence 
activated cell sorting (FACS). These cells are then chromium-51 labeled and used as 
target cells for epitope-specific CTL lines. Cytolysis, detected by 51Cr release, indicates 

10 production of MHC presentation of minigene-encoded CTL epitopes. 

[106] In vivo immunogenicity is a second approach for functional testing of 
minigene DNA formulations. Transgenic mice expressing appropriate human MHC 
molecules can be immunized with the DNA product. The dose and route of 
administration are formulation dependent (e.g. JM for DNA in PBS, IP for lipid- 

15 complexed DNA). Twenty-one days after immunization, splenocytes are harvested and 
restimulated for 1 week in the presence of peptides encoding each epitope being tested. 
These effector cells (CTLs) are assayed for cytolysis of peptide-loaded, chromium-51 
labeled target cells using standard techniques. Lysis of target cells sensitized by MHC 
loading of peptides corresponding to minigene-encoded epitopes demonstrates DNA 

20 vaccine function for in vivo induction of CTLs. 

[ 1 07] Transgenic animals of appropriate haplotypes may additionally provide a 
useful tools in optimizing the in vivo immunogenicity of minigene DNA. In addition, 
animals such as monkeys having conserved HLA molecules with cross reactivity to CTL 
epitopes recognized by human MHC molecules can be used to determine human 

25 immunogenicity of CTL epitopes (Bertoni, et al, 1 Immunol 161 :4447-4455 (1998)). 

[108] Such in vivo studies are required to address the variables crucial for 
vaccine development, which are not easily evaluated by in vitro assays, such as route of 
administration, vaccine formulation, tissue biodistribution, and involvement of primary 
and secondary lymphoid organs. Because of its simplicity and flexibility, HLA 

30 transgenic mice represent an attractive alternative, at least for initial vaccine development 
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studies, compared to more cumbersome and expensive studies in higher animal species, 
such as nonhuman primates. 

[109] Antigenic peptides can be used to elicit CTL ex vivo, as well. The 
resulting CTL, can be used to treat chronic infections (e.g., viral or bacterial) or tumors in 
5 patients that do not respond to other conventional forms of therapy, or will not respond to 
a peptide vaccine approach of therapy. Ex vivo CTL responses to a particular pathogen 
(infectious agent or tumor antigen) are induced by incubating in tissue culture the 
patient's CTL precursor cells (CTLp) together with a source of antigen-presenting cells 
(APC) and the appropriate immunogenic peptide. After an appropriate incubation time 
10 (typically 1-4 weeks), in which the CTLp are activated and mature and expand into 
effector CTL, the cells are infused back into the patient, where they will destroy their 
specific target cell (an infected cell or a tumor cell). 
J: [110] The peptides may also find use as diagnostic reagents. For example, a 

B peptide of the invention may be used to determine the susceptibility of a particular 

n 15 individual to a treatment regimen which employs the peptide or related peptides, and thus 
^ may be helpful in modifying an existing treatment protocol or in determining a prognosis 

n for an affected individual. 

=5 [ 1 1 1 ] For example, a peptide of the invention may be used in a tetramer staining 

;-J assay to assess peripheral blood mononuclear cells for the presence of antigen-specific 

y 20 CTLs following exposure to a pathogen or immunogen. The HLA-tetrameric complex is 
J used to directly visualize antigen-specific CTLs (see, e.g., Ogg, et al Science 279:2103- 

2106, 1998; and Altaian, et al Science 174:94-96, 1996) and determine the frequency of 
the antigen-specific CTL population in a sample of peripheral blood mononuclear cells. 
A tetramer reagent using a peptide of the invention may be generated as follows: A 
25 peptide that binds to an allele-specific HLA molecules, or supertype molecules, is 

refolded in the presence of the corresponding HLA heavy chain and p 2 -microglobulin to 
generate a trimolecular complex. The complex is biotinylated at the carboxyl terminal 
end of the heavy chain at a site that was previously engineered into the protein. Tetramer 
formation is then induced by the addition of streptavidin. By means of fluorescently 
30 labeled streptavidin, the tetramer can be used to stain antigen-specific cells. The cells 
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may then be identified, for example, by flow cytometry. Such an analysis may be used 
for diagnostic or prognostic purposes. 

[112] In addition, the peptides may also be used to predict which individuals 
will be at substantial risk for developing chronic infection. 
5 [1 13] The present application is related to U.S. Serial No. 08/589,108, filed 

1/23/96 and now abandoned, and to U.S. Serial No. 08/205,713 filed 3/4/94, which is a 
continuation-in-part of U.S. Serial No. 08/159,184, filed 1 1/29/93 and now abandoned, 
which is a continuation-in-part of U.S. Serial No. 08/073,205 filed 6/4/93 and now 
abandoned, which is a continuation-in-part of U.S. Serial No. 08/027,146 filed 3/5/93 and 
10 now abandoned. The application is also related to U.S. Serial No. 60/013,980 filed 
3/21/96 and now abandoned, U.S. Serial No. 08/454,033 filed 5/26/95, U.S. Serial No. 
08/349,177 filed 12/2/94, and U.S. Serial No. 08/753,622 filed 1//27/96 and now 
abandoned. Each of the above-referenced applications is incorporated herein by 
reference. 



15 EXAMPLES 

Example 1: Peptides 

[114] Peptides utilized were synthesized as previously described by Ruppert, J. 
et aL, "Prominent Role of Secondary Anchor Residues in Peptide Binding to HLA-A2.1 
Molecules," Cell 74:929-937 (1993) or purchased as crude material from Chiron 

20 Mimotopes (Chiron Corp., Australia). Synthesized peptides were typically purified to 
>95% homogeneity by reverse phase HPLC. Purity of synthesized peptides was 
determined using analytical reverse-phase HPLC and amino acid analysis, sequencing, 
and/or mass spectrometry. Lyophilized peptides were resuspended at 4-20 mg/ml in 
100% DMSO, then diluted to required concentrations in PBS +0.05% (v/v) NP40 (Fluka 

25 Biochemika, Buchs, Switzerland). 
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Example 2: MHC Purification 

[115] The EBV transformed cell lines JY (A*0201), M7B (A*0202), FUN 
(A*0203), DAH (A*0205), CLA (A*0206), KNE (A*0207), AP (A*0207), and AMAI 
(A*6802) were used as the primary source of MHC molecules. Single MHC allele 
5 transfected 72 1 .22 1 lines were also used as sources of A*0202 and A*0207. Cells were 
maintained in vitro by culture in RPMI 1640 medium (Flow Laboratories, McLean, VA), 
supplemented with 2 mM L-glutamine (GIBCO, Grand Island, NY), 100 U (100 ng/ml) 
penicillin-streptomycin solution (GIBCO), and 10% heat-inactivated FCS (Hazelton 
Biologies). Large scale cultures were maintained in roller bottles. 

10 [116] HLA molecules were purified from cell lysates (Sidney, J., et aL, "The 

Measurement of MHC/Peptide Interactions by Gel Infiltration," Curr Prot Immunol 
18.3.1-18.3.19 (1998)). Briefly, cells were lysed at a concentration of 10 s cells/ml in 50 
mM Tris-HCL, pH 8.5, containing 1% (v/v) NP-40 150 mM NaCl, 5 mM EDTA, and 2 
mM PMSF. Lysates were then passed through 0.45 \jM filters, cleared of nuclei and 

15 debris by centrifugation at 10,000 x g for 20 minutes and MHC molecules purified by 
monoclonal antibody-based affinity chromatography. 

[117] For affinity purification, columns of inactivated Sepharose CL4B and 
Protein A Sepharose were used as pre-columns. Class I molecules were captured by 
repeated passage over Protein A Sepharose beads conjugated with the anti-HLA (A, B, 

20 C) antibody W6/32 (Sidney, J., et a/., supra), HLA-A molecules were further purified 
from HLA-B and -C molecules by passage over a B 1.23. 2 column. After 2 to 4 passages 
the W6/32 column was washed with 10-column volumes of 10 mM Tris-HCL, pH8.0 
with 1% (v/v) NP-40, 2-column volumes of PBS, and 2-column volumes of PBS 
containing 0.4% (w/v) n-octylglucoside. Class I molecules were eluted with 50 mM 

25 dimethylamine in 0.15 M NaCl containing 0.4% (w/v) n-octylglucoside, pH 1 1.5.A 1/26 
volume of 2.0 M Tris, pH 6.8, was added to the eluate to reduce the pH to -8.0. The 
eluate was then concentrated by centrifugation in Centriprep 30 concentrators at 2000 
rpm (Amicon, Beverly, MA). Protein purity, concentration, and effectiveness of 
depletion steps were monitored by SDS-PAGE and BCA assay. 
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Example 3: MHC-Peptide Binding Assays 

[118] Quantitative assays to measure the binding of peptides to soluble Class I 
molecules are based on the inhibition of binding of a radiolabeled standard peptide. These 
assays were performed as previously described (Sidney, J., et ah, supra.). Briefly, 1-10 
5 nM of radiolabeled peptide was co-incubated at room temperature with 1 to 1 nM of 
purified MHC in the presence of 1 juM human p 2 -microglubulin (Scripps Laboratories, 
San Diego, CA) and a cocktail of protease inhibitors. Following a two day incubation, 
the percent of MHC bound radioactivity was determined by size exclusion gel filtration 
chromatography using a TSK 2000 column. Alternatively, the percent of MHC bound 
10 radioactivity was determined by capturing MHC/peptide complexes on W6/32 antibody 
coated plates, and determining bound cpm using the TopCount microscintillation counter 
(Packard Instrument Co., Meriden, CT) (Southwood, et a/., Epimmune Technical Report 
Epi 063-99). 

[119] The radio labeled standard peptide utilized for the A*020 1 , A*0202, 
15 A*0203, A*0205, A*0206, and A*0207 assays was an F 6 > Y analog of the HBV core 
18-27 epitope (sequence FLPSDYFPSV). The average IC 5 o of this peptide for each 
molecule was 5.0, 4.3, 10, 4.3, 3.7, and 23 nM, respectively. A C 4 > A analog of HBV 
pol 646 (sequence FTQAGYPAL), or MAGE 1 282 (sequence YVDCVS ARV), was 
utilized as the label for the A*6802 assay. Their IC 50 s for A*6802 were 40 and 8 nM, 
20 respectively. 

[120] In the case of competitive assays, the concentration of peptide yielding 
50% inhibition of the binding of the radiolabeled peptide was calculated. Peptides were 
initially tested at one or two high doses. The IC 50 of peptides yielding positive inhibition 
were then determined in subsequent experiments, in which two to six further dilutions 
25 were tested. Under the conditions utilized, where [label]<[MHC] and IC 50 > [MHC], the 
measured IC50 values are reasonable approximations of the true Kd values. Each 
competitor peptide was tested in two to four independent experiments. As a positive 
control, the unlabeled version of the radiolabeled probe was also tested in each 
experiment. 

30 
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Example 4: Alternative Binding Assay 

[121] Epstein-Barr virus (EB V)-transformed homozygous cell lines, fibroblasts, 
CIR, or 721.22 transfectants were used as sources of HLA class I molecules. These cells 
were maintained in vitro by culture in RPMI 1640 medium supplemented with 2mM L- 
5 glutamine (GIBCO, Grand Island, NY), 50jiM 2-ME, lOOjag/ml of streptomycin, 

lOOU/ml of penicillin (Irvine Scientific) and 10% heat-inactivated FCS (Irvine Scientific, 
Santa Ana, CA). Cells were grown in 225-cm 2 tissue culture flasks or, for large-scale 
cultures, in roller bottle apparatuses. Cells were harvested by centrifugation at 1500 
RPM using an IEC-CRU5000 centrifuge with a 259 rotor and washed three times with 

1 0 phosphate-buffered saline (PBS)(0.0 1 M P0 4 , 0. 1 54 M NaCl, pH 7.2). 

[ 1 22] Cells were pelleted and stored at -70°C or treated with detergent lysing 
solution to prepare detergent lysates. Cell lysates were prepared by the addition of stock 
detergent solution [1% NP-40 (Sigma) or Renex 30 (Accurate Chem. Sci. Corp., 
Westbury, NY 1 1590), 150 mM NaCl, 50 mM Tris, pH 8.0] to the cell pellets (previously 

1 5 counted) at a ratio of 50-1 00 x 1 0 6 cells per ml detergent solution. A cocktail of protease 
inhibitors was added to the premeasured volume of stock detergent solution immediately 
prior to the addition to the cell pellet. Addition of the protease inhibitor cocktail 
produced final concentrations of the following: phenylmethylsulfonyl fluoride (PMSF), 
2 mM; aprotinin, 5 /xg/ml; leupeptin, 10 fig/ml; pepstatin, 10 /ig/ml; iodoacetamide, 100 

20 fM; and EDTA, 3 ng/ml. Cell lysis was allowed to proceed at 4°C for 1 hour with 
periodic mixing. Routinely 5-10 x 10 9 cells were lysed in 50-100 ml of detergent 
solution. The lysate was clarified by centrifugation at 15,000 x g for 30 minutes at 4°C 
and subsequent passage of the supernatant fraction through a 0.2 \i filter unit (Nalgene). 
[123] The HLA-A antigen purification was achieved using affinity columns 

25 prepared with mAb-conjugated Sepharose beads. For antibody production, cells were 
grown in RPMI with 10% FBS in large tissue culture flasks (Corning 25160-225). 
Antibodies were purified from clarified tissue culture medium by ammonium sulfate 
fractionation followed by affinity chromatography on protein- A-Sepharose (Sigma). 
Briefly, saturated ammonium sulfate was added slowly with stirring to the tissue culture 

30 supernatant to 45% (volume to volume) overnight at 4°C to precipitate the 
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immunoglobulins. The precipitated proteins were harvested by centrifugation at 10,000 x 
g for 30 minutes. The precipitate was then dissolved in a minimum volume of PBS and 
transferred to dialysis tubing (Spectro/Por 2, Mol. wt. cutoff 12,000-14,000, Spectum 
Medical Ind.). Dialysis was against PBS (>20 times the protein solution volume) with 4- 
5 6 changes of dialysis buffer over a 24-48 hour period at 4°C. The dialyzed protein 
solution was clarified by centrifugation (10,000 x g for 30 minutes) and the pH of the 
solution adjusted to pH 8.0 with IN NaOH. Protein- A-Sepharose (Sigma) was hydrated 
according to the manufacturer's instructions, and a protein- A-Sepharose column was 
prepared. A column of 10 ml bed volume typically binds 50-100 mg of mouse IgG. 
10 [ 1 24] The protein sample was loaded onto the protein- A-Sepharose column 

using a peristaltic pump for large loading volumes or by gravity for smaller volumes 
(<100 ml). The column was washed with several volumes of PBS, and the eluate was 
Z monitored at A280 in a spectrophotometer until base line was reached. The bound 

2 antibody was eluted using 0.1 M citric acid at suitable pH (adjusted to the appropriate pH 
fl 1 5 with IN NaOH). For mouse IgG-1 pH 6.5 was used for IgG2a pH 4.5 was used and for 
% IgG2b and IgG3 pH 3.0 was used. 2 M Tris base was used to neutralize the eluate. 

0 Fractions containing the antibody (monitored by A280) were pooled, dialyzed against 

3 PBS and further concentrated using an Amicon Stirred Cell system (Amicon Model 8050 
:=? with YM30 membrane). The anti-A2 mAb, BB7.2, was useful for affinity purification. 

U 20 [125] The HLA-A antigen was purified using affinity columns prepared with 

1 mAb-conjugated Sepharose beads. The affinity columns were prepared by incubating 
protein-A-Sepharose beads (Sigma) with affinity-purified mAb as described above. Five 
to 1 0 mg of mAb per ml of bead is the preferred ratio. The mAb bound beads were 
washed with borate buffer (borate buffer: 100 mM sodium tetraborate, 154 mM NaCl, 

25 pH 8.2) until the washes show A280 at based line. Dimethyl pimelimidate (20 mM) in 
200 mM triethanolamine was added to covalently crosslink the bound mAb to the 
protein-A-Sepharose (Schneider, et al 9 1 Biol Chem. 257:10766 (1982). After 
incubation for 45 minutes at room temperature on a rotator, the excess crosslinking 
reagent was removed by washing the beads twice with 10-20 ml of 20 mM ethanolamine, 

30 pH 8.2. Between each one the slurry was placed on a rotator for 5 minutes at room 
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temperature. The beads were washed with borate buffer and with PBS plus 0.02% 
sodium azide. 

[126] The cell lysate (5-10 x 10 9 cell equivalents) was then slowly passed over a 
5-10 ml affinity column (flow rate of 0.1-0.25 ml per minute) to allow the binding of the 
5 antigen to the immobilized antibody. After the lysate was allowed to pass through the 
column, the column was washed sequentially with 20 column volumes of detergent stock 
solution plus 0.1% sodium dodecyl sulfate, 20 column volumes of 0.5 M NaCl, 20 mM 
Tris, pH 8.0, and 10 column volumes of 20 mM Tris, pH 8.0. The HLA-A antigen bound 
to the mAb was eluted with a basic buffer solution (50 mM dimethylamine in water). As 

10 an alternative, acid solutions such as 0.15-0.25 M acetic acid were also used to elute the 
bound antigen. An aliquot of the eluate (1/50) was removed for protein quantification 
using either a colorimetric assay (BCA assay, Pierce) or by SDS-PAGE, or both. SDS- 
PAGE analysis was performed as described by Laemmli (Laemmli, U.K., Nature 
221 (1970)) using known amounts of bovine serum albumin (Sigma) as a protein 

15 standard. Allele specific antibodies were used to purify the specific MHC molecule. In 
the case of HLA-A2, the mAb BB7.2 was used. 

[127] A detailed description of the protocol utilized to measure the binding of 
peptides to Class I HLA molecules has been published (Sette, et al 9 Mol Immunol 
31:813, 1994; Sidney, et aL, in Current Protocols in Immunology, Margulies, Ed., John 

20 Wiley & Sons, New York, Section 18.3, 1998). Briefly, purified MHC molecules (5 to 
500nM) were incubated with various unlabeled peptide inhibitors and l-10nM l25 I- 
radiolabeled probe peptides for 48h in PBS containing 0.05% Nonidet P-40 (NP40) (or 
20% w/v digitonin for H-2 IA assays) in the presence of a protease inhibitor cocktail. 
The final concentrations of protease inhibitors (each from CalBioChem, La Jolla, C A) 

25 were 1 mM PMSF, 1.3 nM 1.10 phenanthroline, 73 jiM pepstatin A, 8mM EDTA, 6mM 
N-ethylmaleimide, and 200 jiM N alpha-p-tosyl-L-lysine chloromethyl ketone (TLCK). 
All assays were performed at pH 7.0. 

[128] Following incubation, MHC-peptide complexes were separated from free 
peptide by gel filtration on 7.8 mm x 15 cm TSK200 columns (TosoHaas 16215, 

30 Montgomeryville, PA), eluted at 1.2 mls/min with PBS pH 6.5 containing 0.5% NP40 
and 0.1% NaN 3 . The eluate from the TSK columns was passed through a Beckman 170 



radioisotope detector, and radioactivity was plotted and integrated using a Hewlett- 
Packard 3396A integrator, and the fraction of peptide bound was determined. 

[ 1 29] Radiolabeled peptides were iodinated using the chloramine-T method. A 
specific radiolabeled probe peptide was utilized in each assay. Typically, in preliminary 
5 experiments, each MHC preparation was titered in the presence of fixed amounts of 

radiolabeled peptides to determine the concentration of HLA molecules necessary to bind 
10-20% of the total radioactivity. All subsequent inhibition and direct binding assays 
were performed using these HLA concentrations. 

[ 1 30] Since under these conditions [label]<[HLA] and IC 50 >[HLA], the 

10 measured IC50 values are reasonable approximations of the true K D values. Peptide 

inhibitors are typically tested at concentrations ranging from 120 jag/ml to 1.2 ng/ml, and 
are tested in two to four completely independent experiments. To allow comparison of 
the data obtained in different experiments, a relative binding figure is calculated for each 
peptide by dividing the IC50 of a positive control for inhibition, i.e. the reference peptide 

1 5 that is included in every binding assay, by the IC 50 for each tested peptide (typically 

unlabeled versions of the radiolabeled probe peptide). For database purposes, and inter- 
experiment comparisons, relative binding values are compiled. These values can 
subsequently be converted into normalized IC50 nM values by dividing the standard 
historical IC50 of the reference peptide by the relative binding of the peptide of interest. 

20 This method of data compilation has proven to be the most accurate and consistent for 
comparing peptides that have been tested on different days, or with different lots of 
purified MHC. 

[131] For example, the standard reference peptide (or positive control) for the 
HLA-A2.1 binding assays described herein is the peptide having a sequence of 

25 FLPSDYFPS V, which has an average historical IC50 value of 5 nM in multiple, repeated 
binding assays. This standard value is used to normalize reported IC 50 values for HLA- 
A2.1 binding as described herein. Thus, a relative binding value of a test HLA-A2.1 
motif-bearing peptide can be converted into a normalized IC 50 by dividing the standard 
reference IC 50 value, i.e. 5 nM, by the relative binding value of the test HLA-A2.1 motif- 

30 bearing peptide. 
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Example 5: Sequence and Binding Analysis 

[132] Using the assay described in Example 3, a relative binding value was 
calculated for each peptide by dividing the IC 5 o of the positive control for inhibition by 
the IC 50 for each tested peptide. These values can subsequently be converted back into 
5 IC50 nM values by dividing the IC50 nM of the positive controls for inhibition by the 

relative binding of the peptide of interest. This method of data compilation has proved to 
be accurate and consistent for comparing peptides that have been tested on different days 
or with different lots of purified MHC. Standardized relative binding values also allow 
the calculation a geometric mean, or average relative binding value (ARB), for all 

10 peptides with a particular characteristic (Ruppert, J., et al, "Prominent Role of Secondary 
Anchor Residues in Peptide Binding to HLA-A2.1 Molecules," Cell 74:929-937 (1993); 
Sidney, J., et al, "Definition of an HLA-A3-Like Supermotif Demonstrates the 
Overlapping Peptide Binding Repertoires of Common HLA Molecules," Hum Immunol. 
45:79-93 (1996); Sidney, J., et al, "Specificity and Degeneracy in Peptide Binding to 

15 HLA-B7-Like Class I Molecules," J. Immunol. 157:3480-3490 (1996); Kondo, A., et al, 
"Prominent Roles of Secondary Anchor Residues in Peptide Binding to HLA-A24 
Human Class I Molecules,"/. Immunol. 155:4307-4312 (1995); Kondo, A., et al, 'Two 
Distinct HLA-A*0101-Specific Submotifs Illustrate Alternative Peptide Binding Modes," 
Immunogenetics 45:249-258 (1997); Gulukota, K., et al, "Two Complementary Methods 

20 for Predicting Peptides Binding Major Histocompatibility Complex Molecules," J. Mol. 
Biol 267:1258-1267 (1997); Southwood, S., et al, "Several Common HLA-DR Types 
Share Largely Overlapping Peptide Binding Repertoires," J. Immunol 160:3363-3373 
(1998)). 

[1 33] Maps of secondary interactions influencing peptide binding to HLA-A2 
25 supertype molecules based on ARB were derived as previously described (Ruppert, J. et 
al, "Prominent Role of Secondary Anchor Residues in Peptide Binding to HLA-A2. 1 
Molecules," Cell 74:929-937 (1993); Sidney, J., et al, "Definition of an HLA-A3-Like 
Supermotif Demonstrates the Overlapping Peptide Binding Repertoires of Common HLA 
Molecules " Hum Immunol 45:79-93 (1996); Sidney, J., et al, "Specificity and 
30 Degeneracy in Peptide Binding to HLA-B7-Like Class I Molecules," J. Immunol. 
157:3480-3490 (1996); Kondo, A., et al, "Prominent Roles of Secondary Anchor 



Residues in Peptide Binding to HLA-A24 Human Class I Molecules," J, Immunol 
155:4307-4312 (1995); Kondo, A., et al, "Two Distinct HLA-A*0 101 -Specific 
Submotifs Illustrate Alternative Peptide Binding Modes," Immunogenetics 45:249-258 
(1997); Gulukota, K., et al, "Two Complementary Methods for Predicting Peptides 
5 Binding Major Histocompatibility Complex Molecules," 1 Mol Biol 267:1258-1267 
(1997)). Essentially, all peptides of a given size (8, 9, 10 or 1 1 amino acids) and with at 
least one tolerated main anchor residue were selected for analysis. The binding capacity 
of peptides in each size group was analyzed by determining the ARB values for peptides 
that contain specific amino acid residues in specific positions. For determination of the 

1 0 specificity at main anchor positions ARB values were standardized relative to the ARB of 
peptides carrying the residue associated with the best binding. For secondary anchor 
determinations, ARB values were standardized relative to the ARB of the whole peptide 
set considered. That is, for example, an ARB value was determined for all 9-mer 
peptides that contain A in position 1, or F in position 7, etc. Because of the rare 

15 occurrence of certain amino acids, for some analyses residues were grouped according to 
individual chemical similarities as previously described (Ruppert, J. et al, supra; Sidney, 
J., et al, supra; Sidney, J., et al, supra; Kondo, A., et al, supra; Kondo, A., et al, supra; 
Gulukota, K., et al, supra; Southwood, S., et al, supra). 

Frequencies of HLA-A2-Supertvpe Molecules 

20 [ 1 34] To select a panel of A2-supertype molecules representative of the allelic 

forms most frequent in major ethnic groups, unpublished population typing data from D. 
Mann and M. Fernandez-Vina were utilized. These data were consistent with published 
data (Sudo, T., et al, "DNA Typing for HLA Class I Alleles: I. Subsets of HLA-A2 and 
of -A28," Hum. Immunol 33:163-173 (1992); Ellis, J.M., et al, "Frequencies of HLA-A2 

25 alleles in Five US Population Groups," Hum. Immunol 61 :334-340 (2000); Krausa, P., et 
al, "Genetic Polymorphism Within HLA-A*02: Significant Allelic Variation Revealed 
in Different Populations " Tissue Antigens 45:233-231 (1995) and Imanishi, T., et al, 
"Allele and Haplotype Frequencies for HLA and Complement Loci in Various Ethnic 
Groups" Tsuji, K., et al, (eds): HLA 1991, Proceedings of the Eleventh International 

30 Histo-Compatibility Workshop and Conference, Vol 1 ., Oxford University Press, 
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Oxford, pp. 1065-1220 (1992)), and are shown in Table 3. For the four major ethnic 
groups considered, it was apparent that seven HLA alleles represent the vast majority of 
A2 supertype alleles. Included in this group are A*0201, A*0202, A*0203, A*0205, 
A*0206, A*0207, and A*6802. Each of these alleles is present in 2% or more of the 
5 general population, and also occur with a frequency greater than 5% in at least one major 
ethnicity. Other alleles are represented with only minor frequencies of 1 .3%, or less, in 
any one major ethnic group. Furthermore, none of the minor alleles are present with a 
frequency greater than 1% in the general population. Based on these observations, 
A*0201, A*0202, A*0203, A*0205, A*0206, A*0207, and A*6802 were selected for 
10 studies defining peptide binding specificity and cross-reactivity in the A2-supertype. 

Main Anchor Positions of A2 Supertype Molecules 

[135] Previous studies indicated a largely overlapping peptide binding 
specificity for a set of Class I molecules designated as the A2-supertype. Here, the main 
peptide binding specificity of A2-supertype molecules was examined in more detail. 

1 5 Some of these results have been published previously, and are shown here only for 
reference purposes (Rupert, J., et al f supra and Sidney, J., et al., "The HLA-A*0207 
Peptide Binding Repertoire is Limited to a Subset of the A*0201 Repetoire," Hum. 
Immunol, 58:12-20 (1997)). 

[136] In a first series of studies, non-conservative lysine (K) substitutions were 

20 introduced at every position of two peptides previously noted to bind multiple A2- 
supertype molecules: 1) the HCV NS3 590 9-mer peptide (sequence YLVAYQATV), 
and 2) the HBV core 18 F 6 > Y 10-mer analog peptide (sequence FLPSDYFPSV). These 
peptides were tested for their capacity to bind A*0201, A*0202, A*0203, A*0205, 
A*0206, A*0207 and A*6802. In the case of the HCV NS3 590 peptide (Table 4a), K 

25 substitutions at position 2 and the C-terminus resulted in greater than 100-fold reduction 
in binding to each HLA molecule. Greater than 100-fold decreases in binding were also 
noted in the context of A*6802 when K was substituted in positions 1 and 5. Reductions 
in binding capacity in the 10-100-fold range were noted when substitutions were made at 
several other positions, notably positions 3 and 7. When the 10-mer HBV core 18 F 6 >Y 

30 ligand (Table 4b) was investigated, greater than 1 00-fold reductions in binding capacity 
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were again noted when the peptide was substituted at position 2 and the C-terminus. 
Significant reductions in binding were also observed following substitution at position 7. 

[137] Together, these data suggest that A2-supertype molecules bind both 9- and 
1 0-mer peptide ligands via anchor residues in position 2 and at the C-terminus. The 
5 presence of an additional primary or secondary anchor towards the middle of the peptide 
is demonstrated by the fact that the binding of both the 9-mer and 10-mer peptides was 
usually reduced by substitutions at position 7. 

Specificity of the Position 2 and C-Terminal Anchor Residues 

[138] Based on these results, the ligand specificity of A2-supertype molecules at 

1 0 position 2 and the C-terminus was analyzed using additional HCV NS3 590 and HBV 
core 18 F 6 >Y single substitution analogs, and also single substitution analogs of a poly- 
alanine peptide (peptide 953.01; sequence ALAKAAAAV). For these analyses, 
preferred amino acids for anchor residues were defined as those associated with a binding 
capacity within 10-fold of the optimal residue. Amino acids whose relative binding 

15 capacity was between 0.01 and 0.1 were defined as tolerated, and those associated with a 
binding capacity less than 0.01 were considered as non-tolerated. 

[139] At position 2 small aliphatic and hydrophobic residues were found to be 
generally tolerated, while other residues, including large polar, aromatic, and charged 
residues were typically not well tolerated (Tables 5a-c). L, I, V, and M were preferred as 

20 anchor residues in most (>80%) contexts. A, T, Q, and S were less frequently preferred 
as anchor residues, but were either preferred or tolerated in >80% of the contexts 
examined (Table 5d). None of the other amino acids examined were preferred in any 
context and only rarely tolerated. 

[140] At the C-terminus (Tables 6a-c), V was found to be the optimal residue in 

25 the context of all 3 parent peptides for A*0201, A*0206, and A*6802, and in 2 out of 3 
cases for A*0203 and A*0205. Overall, either V or L was the optimal C-terminal 
residue for each molecule, regardless of the peptide tested. The aliphatic/hydrophobic 
amino acids V, L, and I were preferred as anchor residues in >66.7% of the MHC-peptide 
contexts (Table 6d). M, A, and T were tolerated approximately 50% of the time. Other 

30 residues examined were either not accepted at all, or were tolerated only rarely. 
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A Re-Evaluation of the Peptide Binding Specificity of A*0201 



[141] The fine specificity of A*0201 binding was investigated in more detail 
using a database of over 4000 peptides between 8- and 11 -residues in length. It was 
found that over 30% of the peptides bearing L or M in position 2 bound A*0201 with 
5 affinities of 500 nM, or better (Figure la). Between 5 and 15% of the peptides bearing 
the aliphatic residues I, V, A, T, and Q bound with IC 50 s of 500 nM, or better. No other 
residue, including aromatic (F, W, and Y), charged (R, H, K, D, and E), polar (S and N) 
and small (C, G, and P) residues, was associated with IC 50 s of 500 nM, or better. 

[142] Consistent with the single substitution analysis, V was found to be the 

10 optimal A*0201 C-terminal anchor residue (Figure lb). Overall, 31.9% of the peptides 
with V at the C-tenninus were A*0201 binders. I, L, S, C, M, T and A were also 
tolerated, with 7. 1 to 28.6% of the peptides binding with an IC 50 of 500 nM, or better. 

[143] The correlation between peptide length (between 8 and 1 1 residues) and 
binding capacity was analyzed next. It was found that 27.6% of the 9-mer peptides 

15 bound with IC 5 o of 500 nM, or less, in good agreement with previous estimates (Rupert, 
J., et aL, supra) (Table 7a). 

[144] Longer peptides were also capable of binding, although somewhat less 
well; 17.8% of 10-mer, and 14.5% of the 11-mer peptides had affinities of 500 nM or 
better. Finally, it was noted that 8-mer peptides bound A*0201 only rarely, with 3.5% of 

20 the peptides having binding capacities better than 500 nM. 

[145] The A*0201 peptide binding database was further analyzed to assess the 
stringency of the A*0201 motif. As expected, peptides with preferred residues in each 
anchor position bound most frequently (48.7%), and with higher average relative binding 
capacity than other peptides in the library (Table 7b). Peptides with one preferred residue 

25 and one tolerated residue also bound relatively frequently, in the 17.6 to 28.4% range. 

Finally, peptides with at least one non-tolerated residue, or with tolerated residues at both 
main anchor positions, bound only rarely, if at all, with frequencies of binding in the 0- 
7.1% range. No significant difference was detected in terms of primary anchor 
preferences as a function of ligand size. 

30 [146] To identify secondary anchor effects, the binding capacity of peptides in 

each size group was further analyzed by determining the ARB values for peptides that 



contain a particular amino acid residues in specific, but size dependent, position. The 
resulting ARB values, by corresponding residue/position pairs, for 8-1 1-mer sequences 
are shown in Table 8a-d. Summary maps are shown in Figure 2a-d. In most positions, 
some secondary influence could be detected. The majority (55%) of the negative 
5 influences involved the presence of acidic (D and E) or basic (R, H, and K) residues. 
Proline (P) and large polar residues (Q, and N) were also frequently disruptive. While 
each particular size was associated with unique preferences, in most instances (79%) 
preferred residues were aromatic (F, W, or Y) or hydrophobic (L, I, V, or M). Most 
peptide lengths exhibited a preference for F, Y and M in position 3. Similarly, all peptide 

1 0 sizes shared a preference for aromatic or hydrophobic residues in the C-2 position. 

Several distinct preference patterns were also observed for peptides of a given size. For 
example, 8-mer peptides did not have any preference in either position 1 or position 3 for 
the hydrophobic or aromatic residues preferred by 9-, 10-, and 1 1-mer peptides. 1 1-mer 
peptides were unique in the preference for G in multiple positions throughout the middle 

15 of the peptide. 

Main Anchor Specificities of Other A2-Sur>ertvpe Molecules 

[147] In the next set of analyses, the main anchor specificities of A*0202, 
A*0203, A*0206, and A*6802, four of the most prevalent A2-supertype alleles next to 
A*0201, was assessed. Peptides in the A2-supertype binding database often reflect 

20 selection using an A*0201 -based bias, such as the selection of only A*0201 binding 
peptides, or the selection of peptides scoring high in A*0201 algorithms. As a result, in 
most cases, peptide binding data for non-A*0201 molecules is available for only peptides 
with supertype preferred and tolerated residues. Despite this limitation, a database of 
about 400 peptides was available for study. A database of sufficient size was not 

25 available to allow analysis of A*0205 and A*0207, although an analysis of the specificity 
of A*0207 has been published previously (Sidney, J., et ai t supra). 

[148] Analyses of the position 2 specificities are summarized in Figure 3a-d. In 
general, V, T, A, I, and M were tolerated in the context of each molecule. Allele specific 
preferences were also noted. In the case of A*0202 Q was the most preferred residue. 

30 Other residues (L, I, V, A, T and M) were tolerated, and were roughly equivalent, with 
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ARB in the 0.08-0.30 range. By contrast, A*0203 had a preference for L, M and Q. 
Residues V, A, I and T were associated with lower overall binding affinities. A third 
pattern was noted for A*0206, where Q, V, I, A, and T were all well tolerated with ARB 
values between 0.47 and 1.0, while L and M were less well tolerated. Finally, for 
5 A*6802 V and T were the optimal residues, with ARB >0.45. A was also preferred, but 
with a lower ARB (0.13). Significant decreases in binding were seen with I and M, 
which had ARB between 0.050 and 0.020. L and Q were not tolerated, with ARB 
O.010. 

[149] At the C-terminus, I, V, L, A, M and T were tolerated by all A2-supertype 
10 molecules tested, with ARB >0.060 (Figure 4a-d). I and V were the two residues most 
preferred by each allele; V was the optimal residue for A*0203, A*0206, and A*6802. L 
was typically the next most preferred residue. T, A, and M were usually associated with 
lower ARB values. 

[150] In conclusion, the position 2 and C-terminal anchor residues preferred or 
15 tolerated by A*0201 were also well tolerated by other A2-supertype molecules. While 
each allele had a somewhat unique pattern of preferences at position 2, the patterns of 
preferences exhibited by each allele at the C-terminus were fairly similar. 

Secondary Influences on Peptide Binding to A2-Supertvpe Molecules 

[151] The same library of peptide ligands was analyzed to determine the ligand 
20 size preferences of A*0202, A*0203, A*0206, and A*6802. We found that for each 
molecule 9-1 1 mer peptides were well tolerated, with ARB >0.36 (Table 9 a-d). For 
A*0203, A*0206, and A*6802, 9-mer peptides were optimal, but 10-mers were optimal 
in the case of A*0202. For all alleles, 8-mer peptides were much less well tolerated, with 
ARB in each case < 0.11. 
25 [152] The influence of secondary anchor residues on the capacity of peptides to 

bind A*0202, A*0203, A*0206, and A*6802 was examined next. The number of 
peptides available only allowed analysis of 9- and 10-mer ligands. The ARB values for 
9-mer and 10-mer peptides as a function of the presence of a particular residue in a 
specific position are shown in Tables 10-13, and summary maps in Figures 5-8. 
30 Deleterious effects were frequently (35%) associated with charged residues (D, E, R, H, 



or K). An additional 35% of the deleterious influences could be attributed to G or P. 
Positive influences were relatively evenly attributed to basic (R, H, K), acid (D, E), 
hydrophobic (F, W, Y, L, I, V, M) or small (A, P) residues. 

[153] While each molecule had a distinctive pattern of preferences and 
aversions, some common trends could be noted in the case of 10-mer peptides. For 
example, for all molecules Q and N were preferred in position 1, and R, H, and K were 
preferred in position 8. D, E, and G were uniformly deleterious for 10-mer peptides in 
position 3. Consensus preferences or aversions were not noted for 9-mer peptides. 

[154] In summary, the data in this section describe detailed motifs for 9- and 10- 
mer peptides binding to A*0202, A*0203, A*0206, and A*6802. Each motif is 
characterized by specific features associated with good, or poor, binding peptides. 

A Consensus A2-Supermotif 

[155] The motifs described above for A2 supertype molecules are very similar 
and largely overlapping. In this respect, a consensus motif can be identified that 
incorporates features commonly shared by the molecule-specific motifs (Figure 9). The 
consensus motif specifies the presence of hydrophobic and aliphatic residues in position 2 
of peptide ligands. At this position, V, L and M are preferred, while T, Q, A, and I are all 
tolerated. On the basis of the preference rank of each residue in the context of each A2- 
supertype molecule, V is the most preferred residue. At the C-terminus the consensus 
motif specifies the presence of hydrophobic and aliphatic residues L, I, V, M, A, and T. 
V is most frequently the optimal residue, while L and I are also considered preferred, 
typically being the next most optimal residues. M, A, and T are considered as tolerated 
residues. 

[156] The secondary anchor maps for A*0201, A*0202, A*0203, A*0206, and 
A*6802 were utilized to derive a supertype consensus secondary anchor motif for 9- and 
10-mer peptides (Figure 9). Residues considered as preferred for 3 or more A2-supertype 
molecules, without being deleterious for any molecule, were considered as preferred for 
the supertype consensus motif. Conversely, residues identified as deleterious for 3 or 
more molecules were designated as deleterious in the consensus motif. The consensus 
motif overlaps significantly with the detailed A*0201 motif, and includes a preference for 
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aromatic residues in position 1 and/or 3, and a shared aversion for charged residues in 
position 3. 

Correlation Between A*0201 Binding Affinity and A2-Supertvpe Cross-Reactivity 

[157] How well A*0201 binders also bound to other A2-supertype molecules 
5 was assessed next. It was found that peptides that bound A*0201 with good affinity (IC 50 
<500 nM) frequently bound other A2-supertype molecules (Table 14a). Between 36.1 
and 73.6% of A* 0201 binding peptides bound other A2-supertype molecules. Analysis 
of A2-supertype degeneracy as a function of A*0201 affinity also yielded interesting 
results. 72.8% of the peptides that bound A*0201 with IC50 <500 nM bound 3 or more 

10 A2-supertype molecules (Table 14b). As a general rule, the higher the binding affinity of 
a peptide for A*0201, the higher the likelihood that the peptide would also bind 3 or 
more supertype molecules. Over 96% of the peptides that bound A*0201 with affinities 
of 20 nM or better also bound 3 or more A2-supertype molecules. By contrast, A2- 
supermotif peptides that did not bind A* 0201 with affinities better than 500 nM only 

15 rarely (10%) bound 3 or more A2 supermotif molecules, and never bound 4 or more 

molecules. In summary, this analysis of the cross-reactive binding of peptides to A*0201 
and other A2-supertype molecules confirms the fact that this family of HLA molecules 
recognizes similar structural features in their peptide ligands. 

Analysis 

20 [158] The results of this analysis allow for the detailed definition of the 

properties of peptides that bind to HLA-A*0201 and other A2-supertype molecules. The 
A2-supertype molecules share not only largely overlapping peptide binding specificity, 
but also significantly overlapping peptide binding repertoires. Specific features of 
peptide ligands associated with degenerate A2-supertype binding capacity were identified 

25 which provide a logical explanation for the supertype relationship. 

[159] In a previous study (6) the peptide binding specificity of A*0201 was 
analyzed, and a detailed motif, including the identification of secondary anchor features, 
was constructed. In the present analyses, performed with a 10-fold larger database, we 
confirmed that data and extended the analysis to include 8- and 1 1-mer peptides. Overall, 
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the specificity of A*0201 for 8- and 1 1-mer peptides was largely similar to that for 9- and 

10- mer peptides. For example, regardless of peptide size, the majority of negative 
influences on binding capacity were associated with the presence of charged residues in 
secondary anchor positions, while the majority of positive influences were associated 

5 with the presence of hydrophobic residues. The definition of detailed motifs for 8- and 

1 1- mer peptides should allow for a more complete identification of epitopes. 

[160] Identification of A*0201 binders has been greatly facilitated by the use of 
the algorithms based on ARB values. In the present analyses a substantially larger 
database was used than previously available, allowing for a refinement of algorithm 

1 0 coefficients. Because the newer coefficients are based on a significantly larger data set, 
they are statistically more accurate and should afford more efficient and precise 
prediction of epitopes. Indeed, recent analysis has shown that a revised A*0201 9-mer 
polynomial algorithm based on a larger data set is more accurate than both an older 
algorithm based on a small data set, and neural network prediction methodologies. 

15 [ 1 6 1 ] In addition to increasing the accuracy of epitope prediction (Rupert, J., et 

al, supra; Sidney, J., et al, supra; Kondo, A., et al, supra; Gulukota, K., et al, supra; 
Parker, K.C., et al., "Sequence Motifs Important for Peptide Binding to the Human MHC 
Class I Molecule, HLA-A2," J. Immunol. 149:3580-3587 (1992) and Milik, M., et al, 
"Application of an Artificial Neural Network to Predict Specific Class I MHC Binding 

20 Peptide Sequences," Nature (Biotech) 16:753-756 (1 998)), detailed peptide binding 

motifs defining both primary and secondary anchor positions allow for the rational design 
of optimized ligands. For example, natural sequences carrying sub-optimal residues at 
primary and/or secondary positions can be identified. The sub-optimal residues may be 
replaced with optimal anchors, generating epitopes with increased binding affinity 

25 (Sidney, J., et al, supra; Pogue, R.R., et al, "Amino-Terminal Alteration of the HLA- 
A*0201 -Restricted Human Immunodeficiency Virus Pol Peptide Increases Complex 
Stability and in Vitro Immunogenicity," Proa Natl Acad ScL 9 USA, 92:8166-8170 
(1995) and Bakker, A.B., et al, "Analogues of CTL epitopes With Improved MHC 
Class-I Binding Capacity Elicit Anti-Melanoma CTL Recognizing the Wide-Type 

30 Epitope," Int. J. Cancer, 70:302-309 (1997)). Following this type of modification, wild 
type peptides that were unable to elicit responses, or were poor immunogens, may 
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become highly immunogenic Pogue, R.R., et al, supra; Bakker, A.B., et al, supra; 
Parkhurst, M.R., "Improved Induction of Melanoma-Reactive CTL With Peptides From 
the Melanoma Antigen gplOO Modified at HLA-A*0201 -Binding Peptides," J. Immunol. 
157:2539-2548 (1996); Rosenberg, S.A., etal, "Immunologic and Therapeutic 
5 Evaluation of a Synthetic Peptide Vaccine for the Treatment of Patients With Metastatic 
Melanoma," Nature (Med) 4:321-327 (1998); Sarobe, P., et al., "Enhanced in vitro 
Potency and in vivo Immunogenicity of a CTL Epitope From Hepatitis C Virus Core 
Protein Following Amino Acid Replacement at Secondary HLA-A2.1 binding positions," 
/. Clin. Invest. 102:1239-1248 (1998) and Ahlers, J.D., et al, "Enhanced 

10 Immunogenicity of HIV-1 Vaccine Construct by Modification of the Native Peptide 

Sequence," Proc. Nat'lAcad. ScL, USA, 94:10856-10861 (1997)). The CTL induced by 
such analog peptides have been shown to be capable, in most instances, of recognizing 
target cells expressing wild type antigen sequences. This phenomenon is likely to reflect 
less stringent epitope binding requirements for target cell recognition compared to that 

1 5 needed for stimulation of naive T-cells to induce differentiation into effectors (Cho, B.K., 
et al., "Functional Differences Between Memory and Naive CD8 T Cells," Proc. Nat 'I. 
Acad. Sci. USA 96:2976-2981 (1999); Sykulev, Y., et al, "Evidence That A Single 
Peptide - MHC Complex On A Target Cell Can Elicit Acytolytic T Cell Response," 
Immunity 4:565-571 (1996)). Thus, the detailed motifs described herein will facilitate not 

20 only in the identification of naturally occurring CTL epitopes, but also in the design of 
engineered epitopes with increased binding capacity and/or immunogenic characteristics. 

[ 1 62] The peptide binding specificity for other A2-supertype molecules was also 
investigated using single substitution analog peptides and peptide libraries. In agreement 
with previous reports (del Guercio, M-F, et al, "Binding of a Peptide Antigen to Multiple 

25 HLA Alleles Allows Definition of an A2-Like Supertype," J. Immunol. 1 54:685-693 
(1995) and (Sidney, J., et al, "Practical, Biochemical and Evolutionary Implications of 
the Discovery of HLA Class I Supermotifs," Immunol Today 17:261-266 (1996)); see 
also reports filed for NIH-NIAID contract NOl-AI-45241), we found that the primary 
anchor motifs of A2-supertype molecules were remarkably similar (Sudo, T., et al, 

30 "Differences in MHC Class I Self Peptide Repertoires Among HLA-A2 Subtypes," /. 
Immunol. 155:4749-4756 (1995); Rotzschke, O., et al, 'Teptide Motifs of Closely 



Related HLA Class I Molecules Encompass Substantial Differences/' Eur J. Immunol 
22:2453-2456 (1992); and Barouch, D., et al. 9 "HLA-A2 Subtypes Are Functionally 
Distinct in Peptide Binding and Presentation," J. Exp. Med. 182:1847-1856 (1995)). The 
use of peptide libraries allowed detailed characterization of the secondary anchor 
5 preferences and aversions of each molecule. It was shown that, while each A2-supertype 
molecule had a unique specificity, a supermotif based on consensus patterns could be 
identified. Because the supermotif describes features of peptide ligands that are shared 
amongst A2-supertype molecules, it is expected to allow the efficient identification of 
highly cross-reactive peptides, and indicate appropriate strategies for anchor fixing, 

10 allowing modulation of the supertype degeneracy of peptide ligands. A further result of 
the present analysis was the derivation of coefficients that could be utilized in algorithms 
for predicting peptide binding to A*0202, A*0203, A*0206, and A*6802. 

[163] HLA A* 0201 is by far the most prevalent A2-supertype allele, both in the 
general population and within major ethnic groups. Thus, the peptide screening strategy 

15 that were utilized focuses first on the identification of A* 0201 binders. It was 

determined that over 70% of the peptides that bind to A*0201 also bind to at least 2 
additional A2-supertype molecules, and that the propensity to bind other A2-supertype 
alleles correlated with A*0201 binding affinity. 

[164] In conclusion, the data described herein provide formal demonstration of 

20 the shared peptide binding specificity of a group of HLA- A molecules designated as the 
A2-supertype. Not only do these molecules recognize similar features at primary and 
secondary anchor positions of their peptide ligands, they also share largely overlapping 
peptide binding repertoires. The demonstration that these molecules share largely 
overlapping repertoires has a significant implication for the design of potential vaccine 

25 constructs. Indeed, the concept that A2-supertype cross-reactivity at the peptide binding 
level may be of immunological relevance has been demonstrated in a number of studies, 
in both infectious disease (Khanna R., et al, "Identification of Cytotoxic T-Cell Epitopes 
Within Epstein-Barr Virus (EBV) Oncogene Latent Membrane Protein 1 (LMP1): 
Evidence for HLA A2 Supertype-Restricted Immune Recognition of EBV-Infected Cells 

30 by LMPl-Specific Cytotoxic T lymphocytes," Eur J Immunol, 28:451-458 (1998); 

Bertoletti, A., et ai 7 "Molecular Features of the Hepatitis B Virus Nucleocapsid T-Cell 



Epitope 18-27: Interaction With HLA An T-Cell Receptor," Hepatology 26:1027-1034 

(1997); Livingston, B.D., et aL, "Immunization With the HBV Core 18-27 Epitope Elicits 
CTL Responses in Humans Expressing Different HLA-A2 Supertype Molecules," Hum 
Immunol 60:1013-1017, (1999); Bertoni, R., et aL, "Human Histocompatibility 
5 Leukocyte Antigen-Binding Supermotifs Predict Broadly Cross-Reactive Cytotoxic T 
Lymphocyte Responses in Patients With Acute Hepatitis," J Clin Invest 100:503-513 
(1 997); and Doolan, D.L., et aL, "Degenerate Cytotoxic T-Cell Epitopes from P. 
falciparum Restricted by Multiple HLA-A and HLA-B Supertype Alleles," Immunity 
1:91 A 12 (1997)) and cancer (Fleischhauer, K., et aL, "Multiple HLA-A Alleles Can 

1 0 Present an Immunodominant Peptide of the Human Melanoma Antigen Melan-A/MART- 
1 To A Peptide-Specific HLA-A*0201+ Cytotoxic Cell Line," J Immunol, 157: 787-797 
(1996); Rivoltini, L., et aL, "Binding and Presentation of Peptides Derived From 
Melanoma Antigens MART-1 and Glycoprotein-100 by HLA-A2 Subtypes: Implications 
for Peptide-Based Immunotherapy " J Immunol 156:3882-3891 (1996); Kawashima, L, 

1 5 "The Multi-Epitope Approach for Immunotherapy for Cancer: Identification of Several 
CTL Epitopes from Various Tumor-Associated Antigens Expressed on Solid Epithelial 
Tumors," Hum Immunol 59:1-14 (1998)) settings. 

Example 6: Peptide Composition for Prophylactic Uses 

[165] Vaccine compositions of the present invention are used to prevent 
20 infection or treat cancer in persons. For example, a polyepitopic peptide epitope 

composition containing multiple CTL and HTL epitopes is administered to individuals at 
risk for HCV infection. The composition is provided as a single lipidated polypeptide 
that encompasses multiple epitopes. The vaccine is administered in an aqueous carrier 
comprised of Freunds Incomplete Adjuvant. The dose of peptide for the initial 
25 immunization is from about 1 to about 50,000 |ig for a 70 kg patient administered in a 

human dose volume. The initial administration of vaccine is followed by booster dosages 
at 4 weeks followed by evaluation of the magnitude of the immune response in the 
patient, by techniques that determine the presence of epitope-specific CTL populations in 
a PBMC sample. Additional booster doses are administered as required. The 
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composition is found to be both safe and efficacious as a prophylaxis against HCV 
infection. 

[166] Alternatively, the polyepitopic peptide composition can be administered as 
a nucleic acid in accordance with methodologies known in the art and disclosed herein. 

[167] The above discussion is provided to illustrate the invention but not to limit 
its scope. Other variants of the invention will be readily apparent to one of ordinary skill 
in the art and are encompassed by the appended claims. All publications, patents, and 
patent applications cited herein are hereby incorporated by reference. 
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Table 3 



Fhenotypic frequency* 



Supertype 


Allele 


Blacks 


Caucasians 






Average 


A2 


A*0201 


223 


45.6 


18.1 


37.1 


30,8 




A*68Q2 


12,7 


1.8 


0.0 


4.2 


4.7 




A*0206 


0.0 


0.4 


93 


63 


4.0 




A*0207 


0.0 


0.0 


11.0 


0.0 


2.7 




A*0205 


S3, 


1.8 


0.3 


3.0 


2.5 




A*0203 


0.0 


0.0 


8.8 


0.0 


2.2 




A*0202 


6.4 


0.0 


0.5 


1.3 


2.0 




A*6901 


0.0 


0.7 


0.3 


1.3 


0.6 




A*02tl 


0.0 


0.0 


0.0 


1.3 


0.3 




A*02i2 


0.0 


0.0 


0.3 


0.8 


0.3 




A*0213 


0.0 


0.0 


0.0 


0.4 


0.1 




A*0214 


0.0 


0.0 


0.0 


0.0 


0.0 




Total 


43.1 


48.2 


45.0 


51.9 


47.1 



♦ t 

a) Phenocypic frequencies were calculated from unpublished data provided by M. Fernandez- Vina and D. Mann. Frequencies 
greater than 5% are indicated by bold foot. 



57 



Table 4 



a. HCV NS3 590. 









Sequence 








Relative binding capacity 


i 


2 


3 


4 5 6 


7 


S 


9 


A*0201 A*0202 A*0203 A*0205 A*0206 A*6802 


Y 


L 


V 


A Y Q 


A 


T 


V 




K 


K 


K 


K 

K 

K 


K 


K 


K 


SMDI 0050 fs^^lsB-IESSSMS 

W^WM 0093 0.064 0.022 

f*£0;l?M 0.026 ^§0:^Cffi2^§^t^ 
1I1DI 0.033 gf^^^^^^S 0.060 
0.054 0.016 W^^^ IPO' 0 ^ 5 0,043 



b. HBVcore 18 F^Y. 











Sequence 












Relative binding capacity 


1 


2 


3 


4 


5 6 7 


8 


9 


10 


A*0201 


A*Q202 


A*0203 


A*0205 A*0206 A*0207 A*6802 


F 


L 


P 


s 


D Y F 


P 


S 


V 


i.o ■ 


1.0 


' 1.0 


; 1.0 V L<T ^SvJl^LOBI 


K 
















0.43 


0.75 




0.36 " '1-7.;,^.$;?^ 




K 


























K 












0.44 


" 039 . 


■ - 713 • 


V 0.27 .0-17*1 ' 








K 










0.95 : -, 


-0.82 " 




* b.6i -:' M^^^yiSg^o^s^i 










K 








0.60 \ 


\ : 0.75 " 




-pM - : p:f6 ]r§M^i^^^p 










K 








: . 0.58.--, 


0.70 




: * : b.40 " ~- 039lyZ^^^^^^ 










K 












"0.079 


0.027 












K 






. 0.25' ? 


-".0.22" 


:.h' :r 6:i 


; 0.076 > 029MM'25:f\ 0.092 














K 




:--0.14 V 


- o.ia ; 


;2(X2l 


















K 











a) Binding capacities are expressed as ratios relative to the parent peptide. Peptides whose binding capacities are within 10-fold of the best 
binder are highlighted by shading, and are considered preferred; those whose relative binding capacities are 10- 100-fold less than the best 
binder are considered tolerated. 

b) A dash ("-") indicates relative binding <0.G 1. 
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Table 5 



a. HCV NS3 590. 

Relative binding capacity 



Residue A*0201 A*0202 A»0203 A*0205 A»0206 A*6802 




K ... 

F 0.016 - 0.012 
D - - - 



b. HBV core 18 F6>Y. 



Relative blading capacity 



Residue 


A*0201 


A*0202 


A*0203 


I 


r. : 0.!3 . 


- , r 0.66 *■ 


>t : \ 0.41 - 


L 


; " w 


^0.46 " 


i.o V 


V 


0.065 


' : i.0 


0.10 


T 


0.013 


0.35 


0 025 


Q 


- 0.26 


0.049 


0.49 


F 






0.015 


D 








K 








P 









A*0205 



A*6802 




c. Poly-alanine peptide ALAKAAAAV. 



Residue 
L 
M 
V 
I 
T 
A 



Relative binding capacity 



A*020i A*0202 A*0203 A*0205 A*0206 A*6802 



0.92 



0.22 



T\ 0:43 ..V*a70 -; 0A3^^Q3&i 

0.025 ^0.75.3$ 0.091 Mfi^i^^^ 



0.013 :~#&3iS^ 0.070 



0.089 



0.075 



0.011 

0.010 
0.073 



S 
G 
P 

c 

D 
F 
K 
N 



0.031 



0.023 

0.0 a 




0.057 
0.016 
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£ Summary. 



Allele/Peptide combinations %coI«ated 



Residue 


Tested 


Preferred 


Tolerated 


% preferred 


or preferred 


V 


19 


17 


2 


89.5 


100.0 


L 


19 


16 


3 


84.2 


100.0 


I 


19 


16 


3 


84.2 


100.0 


M 


6 


• 5 


1 


83.3 


100.0 


T 


19 


14 


4 


73.7 


94.7 


A 


6 


2 


4 


33.3 


100.0 


Q 


13 


8 


3 


6U 


84.6 


s 


6 


1 


4 


16.7 


83.3 


G 


6 


0 


3 


0.0 


50.0 


F 


19 


0 


4 


0.0 


21.1 


P 


19 


0 


1 


0.0 


53 


C 


6 


0 


0 


0.0 


0.0 


K 


19 


0 


0 


0.0 


0.0 


N 


6 


0 


0 


0.0 


0.0 


D 


19 


0 


0 


0.0 


0.0 



a) Binding capacities are expressed as ratios relative to the related analog with the highest binding affinity for each 
individual molecule. Peptides whose relative binding capacities are in the 1-0.1 range are highlighted by shading, 
and are considered preferred; those whose relative binding capacities are in the 0.1-0.01 range are considered 
tolerated. A dash ("-") indicates relative binding <0,0I. 

b) Indicates the number of instances in which a given residue was associated with relative binding in the 1-0.1 range 
(preferred) or 0.1-0.01 range (tolerated). 
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Table 6 



a. HCV NS3 590. 



Relative binding capacity 

Residue A*0201 ' A*0202 A»0203 A*0205 A*0206 A*6802 




F 0.066 - 0.044 

D ... 



P 
Q 



b. HBVcore 18 F6>Y. 







Relative binding capacity 




Residue 


A*020l 


A*0202 A*0203 A*0205 A*0206 A*0207 


A*6802 


I 
V 
L 


iT-0.21 


^"-^ 0.23 *3-*'o.26 "-"T <X077 ^^0.23 


0.087 


T 
P 
D 


0.033 
0.023 


0.045 0.027 0.022 „£QiOj_-j 0.027 

0.012 0.010 




F 
K 

Q 









c. Poly-alanine peptide ALAKAAAAV. 



Residue 



Relative binding capacity 



A*6802 




M 


0.025 


£3t0.tS'^f 0.031 


0.049 


0.034 




A 


0.072 


0.077 






0.025 


S 




0.0U 








T 




0.043 








C 












F 












G 












N 












P 












R 












Y 
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d- Summary. 



Allele/Peptide combinations %tolerared 



Residue 


Tested 


Preferred 


Tolerated 


^preferred 


or preferred 


V 


19 


19 


0 


100.0 


100.0 


I 


19 


18 


i 


933 


100.0 


L 


19 


14 


5 


66.7 


100.0 


M 


6 


1 


4 


20.0 


83.3 


T 


19 


3 


9 


20.0 


63.2 


A 


6 


0 


3 


0.0 


50.0 


S 


6 


0 


I 


0.0 


16.7 


P 


19 


0 


3 


0.0 


15.8 


F 


19 


0 


2 


0.0 


10.5 


C 


6 


0 


0 


0.0 


0.0 


G 


6 


0 


0 


0.0 


0.0 


N 


6 


0 


0 


0.0 


0.0 


R 


6 


0 


0 


0.0 


0.0 


K 


13 


0 


0 


0.0 


0.0 


Y 


6 


0 


0 


0,0 


0.0 


D 


13 


0 


0 


0.0 


0.0 


Q 


13 


0 


0 


0.0 


0.0 



a) Binding capacities are expressed as ratios relative to the related analog with the highest binding affinity for each 
individual molecule. Peptides whose relative binding capacities are in the 1-0.1 range are highlighted by shading, 
and are considered preferred; those whose relative binding capacities are in the 0.1-0.01 range are considered 
tolerated. A dash TO indicates relative binding <0.01. 

b) Indicates the number of instances in which a given residue was associated with relative binding in the 1-0.1 range 
(preferred) or 0.1-0.01 range (tolerated). 
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Table 7 



a. Binding as a function of peptide size. 








Peptide 




%Binding 






length 


(n) 


peptides 


ARB 1 




8 


171 


3.5 


0.072 




9 


2066. 


27.6 


1.0 




10 


1451 


17.8 


0.27 




11 


179 


14.5 


0.20 




Total 


3867 


22.2 






b. Binding as a function of main anchor motifs. 


• 






Motif 




%Binding 




Position 2 


C-terminus 


(n) 


peptides 


ARB 


Preferred 


Preferred 


526 


48.7 


1.0 


Preferred 


Tolerated 


1446 


28.4 


0.31 


Tolerated 


Preferred 


558 


17.6 


0.098 


non-tolerated 


Preferred 


27 


0.0 


0.031 


Preferred 


non-tolerated 


66 


6.1 


0.026 


Tolerated 


Tolerated 


1337 


7.1 


0.026 


non-tolerated 


Tolerated 


46 


0.0 


0.015 


non-tolerated 


non-tolerated 


71 


0.0 


0.014 


Tolerated 


non-tolerated 


105 


0.0 


0.013 


Total 




4182 


20.7 





a) ARB values are standardized to the peptide set carrying preferred residues in both primary anchor 
positions. 
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Table 8 



. 8-mer peptides 



Position (ARB*) 




*> A f—el uf 9JJ.**r pepoJct o« u<u»B r occ»mi| &m *^ b*cto^ or pwhoja oop. ,«* 

t«JT«J- AU^uJ«!ud ulewtl ^*farei**i| eternal rcurfM it fee u »dur praam. A23 v*j»<s ibo^n wcr« 

fw*iv« «iT«u »« ftutfwr ui«<ilW by boUcd ft»L tad Mfiuvt tiToca by «»4crtn«ii ud fo«i_ 
m i hfckiJ . rrsvlju J*ienni*oi W tw preferred or toknteU udwi *r« udic 

poouou Jcnverl from the «»lpa Jaaibed 1m fifw t. T» UIo- «m of (Sc * iIkt-* Ou. ubfc u cocfficvMO fee 
prrUkiQve ilgumlu-u. the Tot 3d.-toknicU t*chor re Us a fu*« boa xt o a 03 1. w^v.Iew to 4 ICOO-foU raloetioa « 

bwJwj «p*cuy u fti0 «J«( wmodf (W^eu The tixnp grooMM: bix&f uptdty of tie ?**<d [ A*:o xM 



b. 9-mer peptides 



Residue I 


A 


I s 


C 


0 70 


D 


0 065 


E 


0065 


F 


9.1 


G 


0 34 


H 


0.63 




t.3 


K 


u 


L 


1.9 


M 


1.4 


N 


l.l 


P 


0 074 


Q 


0 33 


R 


t 6 


S 


099 


T 


0.60 


V 


0.93 


W 


0.53 


Y 


10 



Position (ARB*) 



3 


4 


5 


6 


7 


s 


1.2 


13 


1.9 


0.45 


13 


0.30 


0.57 


2.7 


1.4 


11 


0.36 


1-2 


1.2 


1.7 


0.S4 


0.52 


0 21 


0.34 


0.14 




0 31 


0.53 


0.32 


1.4 


4.4 


1.1 


H 


16 


6.8 


4.1 


0J3 


1.6 


0.69 


0.43 


0.23 


0.79 


a79 


0.83 


3.8 


0.26 


1.7 


U 


1.3 


0J6 


li 


10 




0L45 


0.14 


0.56 


057 


0.17 


a/9 


0.46 


12 


a?o 


U 


16 


19 


li 


4.6 


0.20 


a97 


13 


1.0 


0.30 


0,78 


0J2 


QJ1 


0.90 


0.47 


0-47 


0.64 


0162 


a47 


0.89 


1.6 


1.6 


1.2 


0,74 


to 


0.33 


0.62 


0.73 


0.13 


0.47 


a47 


0.17 


a/7 


0.49 


0.65 


1.2 


0.45 


0.97 


0.51 


10 


0-53 


11 


QJ9 


1.9 


0.98 


U 


1.2 


0.56 


1.7 


17 


0.75 


O.JO 


2S 


5.1 


17 


1.3 


7.6 


1.9 


4J 


0.52 


3.2 


1.0 


7.4 


1.7 



-aajto 



"aooia"3 



^000103 



«aoaicg| 



peptilo 1W 4i leut L ^cfartd 4*1 1 uloutd tokIm ttAii 

uJ mdhoJi. Mi <*«r Jaml Tar nek reridM m«iirir»t iadKalull^. At mcovUc^ —-Vrr 

axffimu for pniira*« iljtwriuw. th» nha for n^kntcd «dw icafea b« m e 0001. «|>iv«kmi to , 1000-foW 



byboU 
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c. 10-rocr peptide* 



Residue 



i 



Position (ARB*) 



A 
C 
D 
E 
F 
G 
H 
I 

K 
L 
M 
N 
P 

Q 

R 
S 
T 
V 
W 
Y 



t.3 
0 63 
0/2 

Qdl 

4.4 
13 

0.54 
1.4 
t.3 
1.9 
1.4 

033 
0 // 

0.30 
l.t 
1.7 

0.83 
1.2 

0.71 

9.0 




g|gl 




1.7 
13 
0.15 

&ZZ 

4.1 

.0.44 
0.90 
3.1 

Qdl 

3.6 
9.8 
036 
033 

a/9 

033 
0.44 
096 
1.8 
7.4 



1.6 
13 
1.4 
IS 
1.4 
2.1 
0.76 
0.67 
0.44 
13; 
1.1 
1.4 
0.66 
0.30 
0.35 
0.60 
LI 
034 
4.2 

074 



1.4 
1.3 
1.1 

0.2S 
3.2 
091 
1.2 
2.4 

0.26 
13 
033 
0.39 
0.40 
1.7 
033 
0.43 
1.6 
2.0 
3.5 
0.67 



1.1 
051 

I.I 
075 

13 
0.91 
0.42 

1.6 
039 

13 

1.7 

LI 
0.92 
048 
0.77 
033 
089 
2.2 

LI 
052 



062 
13 
039 
043 
3.0 
031 
074 
27 
0.43 
4.5 
23 
043 
036 
04t 
027 
049 
1.0 
LI 
16 
10 



L2 
26 

0.22 

0.40 

5.0 

0.67 
1.6 
13 
022 
23 
4.6 
0.33 
1.7 
0 32 
0.17 
057 
049 
1.3 
4.8 

17 



LO 

13 
035 
092 

53 

l.l 
032 
037 
0.47 

L2 
033 
079 
035 
0.70 
033 

LI 

13 

L4 

13 

io ^SotiS 



m let to g 00 1 . eqaiviicat U * lOOO-raU ndvcOM i*ba4af ouwatv u filter >wi /»^««.^— tv. . 



d. 1 I-mer peptides 



Residue 



2 'V; 



Position (ARB*) 



3 


4 


5 


6 


7 


3 


9 


10 


1.8 


17 


14 


12 


to 


0 23 


0 074 


13 


2iZ 


021 


0.93 


1.4 


1.9 


0 63 


0.79 


L4 


0.10 


12 


094 


0 30 


02! 


025 


0J3 


13 


O40 


12 


094 


0 30 


02! 


025 


0 23 


13 


64 


0.40 


16 


Qdl 


1.4 


8.8 


6.1 


0/7 


0.13 


LO 


0 30 


14 


21 


5.3 


0.76 


9.0 


042 


03S 


0/2 


0.033 


1.4 


0.51 


0!6 


0.33 


9.2 


L4 


14 


3.7 


0 37 


11 


5.5 


033 


042 


03S 


0.12 


0 033 


L4 


051 


016 


£33 


9.2 


L4 


14 


3.7 


0.37 


11 


53 


033 


9.2 


1.4 


14 


3.7 


0.37 


2.1 


5.5 


033 


0.092 


1.7 


037 


13 


Qdl 


1.6 


l.l 


02/ 


U 


033 


1.4 


Qdl 


0.35 


LI 


0033 


12 


0.092 


1.7 


037 


13 


0.19 


1.6 


LI 


02/ 


042 


033 


072 


0.088 


L4 


031 


0/6 


033 


Q/7 


02/ 


09B 


1.4 


1.9 


0.63 


0.79 


1.4 


0/7 


0.21 


093 


1.4 


1.9 


0.63 


0.79 


1.4 


9.2 


1.4 


14 


3.7 


0.87 


11 


5-5 


033 


6.1 


0.40 


16 


Qdl 


1.4 


8.8 


6.1 


Qdl 


6.1 


040 


16 


0.1! 


1.4 


8.8 


6.1 


0/7 



A 
C 
D 
E 
F 
G 
H 
I 

K 
L 
M 
N 
P 
Q 

R 
S 
T 
V 
W 
Y 




e0.2*d 



a) Apu«toT95 11 



thlbK^atp^fapioniiiwtiuMlyMi A3 pepode* Mulcul] pretend nd 
okalsiot u JcacnM L» iIm smuJi uJ acdudi, tad v% buc4 o* A* p*«paf sfdfcmiafly 

effects vcfoihcr J«aficU*r fe*. *W *r «*J U*lidxsd (tmL M« »«tae poufiw, « t^ol. »4 ^« ictani^ to b« prcTmd or 

tol«^u<bor**nt^k^bybaUf<>.c AA3 viteci ti Ac nctac pockiMa <imni bom the tntrsct 4ombcd U F jww t T«4Uc~ wof ch«*«feoi,Jtt>w»a (fe 
t*bU « <m(T K *«u for prelum dpndxu. dw vWm for to***!**! ,»a« h**, boat « «, aOQt. .prvtkat to * lOOEMbld mfea*» k c^dn. to 
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Table 9 



a. A*0202 



Peptide length 


(n) 


ARB 1 


' 8 


6 


0.050 


9 


268 


0.79 


10 


120 


1.0 


11 


16 


0.90 


Total 


410 




b. A*0203 


Peptide length 


(n) 


ARB 


8 


6 


0.11 


9 


272 


1.0 


10 


122 


0.75 


11 


16 


0.36 


Total 


416 




c. A*0206 


Peptide length 


(n) 


ARB 


8 


6 


0.066 


9 


268 


1.0 


10 


120 


0.38 


11 


16 


0.66 


Total 


410 




d. A*6802 


Peptide length 


(n) 


ARB 


8 


6 


0.071 


9 


268 


1.0 


10 


120 


0.60 


11 


16 


0.47 


Total 


410 





a) ARB values are standardized to the peptide set carrying preferred residues 
in both primary anchor positions. 
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Table 10 



a. 9-roer peptides 



Position (ARB*) 




I) A pud o f I6i 9 ■ m<r peptiha >U*rf o« MMrtUy : . ■ I m t mqtoca from ruio»i vinL h*3m*l orp.JMtaaflfkwuu.^J All peptides W 
*tWa*( I prcferm! uuJ I tokmrf trs*i« u nuii poutiou. AAS *tl>ci *how» m oinUud uJucrbol bnh« iuim*U iad mcttoli. u4 

urn owed o« the poupaf ofcieniaUj' unuWm^u (tee M- «f. «. At soWary wclm poubou *x±a corrcipc-iajf a 4 J-foU or poier 
'e»ovu« J b«4u>j c*p*oty ire «l«ud »aw*«d fast. Psiidu «fT«a *rt fenhcr itksu/W by bokkd (bet. ud atpav* d7=cu by i«d«lii<sl uk! 
iuUu*4 W*u» mcJwr ^imm ire JuJeU. u4 .leem no* to b< pretend or ttkrued iac.Vm *re .adcited by bold fb«L AAA it 

the wcNor ponuofti liemeJ from *< ixUyies dejanbed k Rjvm 3 i*d 4 To »llow «« 0 f 0* viJaes iSow» a ail* u ooemdoto for 
predKti>a »Ifonihm* (St - »I«e* for «o«-«teriied uchor ursrf mi h*v« ben id to 0.001. eq«l**leX to ■ 1000- fold roJ«cao* bu Ju»j «p*ci[y, to fdw 
o« mm fnou/p<^!t*Ici TSe j.cnfe iwnxw tadaj atp*3Cy of the p**el wu OQt iM. 



b. 10-mer peptides 



Position (ARB*) 



Residue 




3 


4 


5 


6 


7 


3 


9 




1.1 


0.31 


1.4 


3.1 


0 56 


I 4 


14 


ISmS 


aw 


3.0 


1.2 


0.95 


0 43 


1 6 


1.5 


ill! 


0 28 


12 


9.1 


3.6 


2.2 


0 0077 


1.3 




0.28 


2.1 


9.1 


3.6 


12 


00077 


1.3 




5.8 


1.3 


0.33 


13 


1.3 


1 5 


1.1 




0.093 


0.83 


1.0 


0.44 


0J2 


10 


0.59 




2.0 


0.52 


0.39 


021 


0.74 


9.9 


0.22 




a« 


0.65 


0.67 


0.60 


6.7 


0.40 


0.60 




10 


0.52 


0.39 


021 


0.74 


9.9 


022 




0.35 


0.65 


0.67 


0.60 


6.7 


0.40 


a 60 




0.S5 


0.65 


0.67 


0.60 


6.7 


0.40 


0.60 




0.33 


1.4 


0.66 


0.36 


0 26 


0.79 


0.91 




U 


0.091 


2.3 


15 


014 


1.2 


3.8 




0JS 


t.4 


0.66 


0,36 


0.26 


0.79 


0.91 




2.0 


0.52 


0.89 


021 


0.74 


9.9 


022 




o.-u 


3.0 


1.2 


0.95 


O.43 


1.6 


1.5 




0.44 


3.0 


1.2 


0.95 


0.43 


1.6 


IJ 




0.S5 


0.65 


0.67 


0.60 


6.7 


0.40 


0.60 




5.8 


1.3 


0.83 


13 


1.3 


1.5 


1.1 




5.8 


1.3 


0.83 


IS 


1.3 


1.5 


Li 





A 
C 
D 
E 
F 
G 
H 
I 

K 
L 
M 
N 
P 
Q 

R 
S 
T 
V 

w 

Y 



») Apuclof 120 LO-mcr 
preferred i»d I tulerued r«atu U the 



twinlbwteritLarpatbofnorisiawu wulyiaL AH pcp&ks !ud « taut t 



■d kirnl Vy acrt it frf fo*C Po«d>« dTeca w Srha U n a T ml Yy bcMed Can. mJ fry 1> { i T ^J .- 4 .taikriri ftft M*u ucbtv puiuou m 

4e«ribeJkrtvw«3-d* To *ilo~ mm of *m *iJm Ju*« « tUa ubk u CMffkkM Ear praGam ilioru^ ite for »MkrtudiMtemUwlMv« 
haMioOOai. eqimiaiwi tOOO-foU mI»cao« ■ ^iag cipadty. W filar »a> awltf p«p«ka. a«en{« xombcov biadwf casKiiy of chtp^t X2 
•M. 
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Table 11 



a, 9-mer peptides 



Residue i 


A 


0.95 


C 


O.-t I 


D 


0.42 


E 


0.42 


F 


33 


G 


i.\ 


H 


0-63 


X 


U 


K 


0 63 


t 


t 1 


M 


1.1 


N 


0 36 


P 


0015 


Q 


0.36 


R 


0-63 


S 




T 


o.-ti 


V 


1 I 


W 


3-3 


Y 


3-3 



Position (ARB*) 


3 


* 


5 


6 


7 


s 


4.4 


2.3 


1.2 


0.36 


43 


1.4 


0.S3 


1.4 


0.91 


0.36 


1.3 


1.7 


0.059 


0.73 


QM 


0.36 


0.56 


0.64 


0.059 


0.73 


0.28 


0.36 


0-56 


0.64 


0.71 


Ql55 


1J 


0.28 


0.075 


1.3 


1.S 


1.5 


O.S6 


1.3 


3-2 


1.2 


4,2 


0.91 


1.9 


0.71 


0.93 


0.30 


0.77 


0.S5 


0.63 


1.9 


1.2 


0.56 


4.2 


0.91 


1.9 


0.71 


0.95 


0.30 


0.77 


0.35 


0.63 


1.9 


t.2 


0.56 


0.77 


0.83 


0.63 


L9 


1.2 


0.56 


U 


0.59 


2.1 


1.3 


0.97 


1.3 


1.0 


0.55 


1.2 


1.3 


1.0 


4.4 


t.3 


0.39 


2.1 


1.3 


0.97 


1.3 


4.2 


0.91 


1.9 


0.71 


0.95 


0.30 


asi 


1.4 


0.91 


0.86 


IS 


1.7 


0.53 


1.4 


0.91 


0.36 


1.2 


1.7 


0.77 


0.85 


0.63 


1.9 


t.2 


0 56 


0.71 


0.35 


1.5 


0.28 


0.075 


1.3 


0.7 1 


0.55 


1.5 


0.23 


0075 


1.3 



"^.aooia 



£0.0010-3 



t) ApMtl afZTZ 9-incr pe?**la o* fi*t»nlly oacam>| te^ioca fror» «uiow vxii btcttfriii. eg p»*»oje« onpm *u uiJvsai. AU peptide* H*d 

JC ;eu t t pcclcrrnj I aknial re»*ia< U tAe nui uchor po«itk>« ARB »iJ»a ihow* wrsx fikiliiri u JeKjibod w ibe nv»i«i»li i«U incdutla. 
i*J irt b*xil a« ifw poapui^ afch«n»uUy liouW ruif»c* (ice « re/ 6). At KCo^Jtfy **dwr 701 kmu vml«cx corres^>o»lL»j <o i WoW o< pnin 
iacreuc L* b»Jui^ <u?4£iry « uwJicaial by acraMrf fo«c Ps«a»e tlTcaj «c fcnha iduu/ici by bokkd fo»t iad i<jaa*t effect* by a*J<r'jioi ud 
iuJjaieJ foul. .VU« utciwr pu*too« m4 raidsc* ilctennbwil to b« pfrferrwi o* ntsiicd ucbon m vdwed by txjid Tost. AR3 ai 

iNc inciter p«>i(ioiu w«re JcrvcU from Um unlysa JtscnbeJ at R^ves J i*J i T» allow u< ii»ei vSsw* l» thi* t*hl* u H^iTicion* fc* 
pral«tiv« iljomivru. th* **luei fof ao«.«okxitcJ lirdtor ra*I*r» h**« boei >d loOOQI. eqij*iJeu lo « l0C0-(oU redKEM* u opacity, to fH ta 

01 xm-tnouJ pe?oiu "Th« *v<r»$e leomctric bt*J«( opacity of the p*ad wi* 1J rM. 



b. 10-mcr peptides 



Position (ARB*) 



Residue i 


A 


2.1 


C 


0 68 


D 


0.32 


£ 


0.32 


F 


8.3 


G 


1.0 


H 


0 75 


I 


0 29, 


K 


0.75 


L 


0.29 


M 


0.29 


N 


6.0 


P 


0.019 


Q 


6.0 


R 


0.75 


S 


0 68 


T 


0.63 


V 


0 29 


W 


8.3 


Y 


8.3 



-2 -j 



<p.0T7 ^ 

"::aooio3 
^aooroTi 
y;aooio 

: xC?00l0- 

^aooia; 
^.aooio 



^-aooio 4 



L5 

0.074 
0.074 
6.4 
Oil 
3.9 
0.33 
3.9 

0.83 
0.33 
0.43 
0.90 
0.43 
3.9 
0.33 

QJI 
0.83 
6.4 
6.4 



l.t 
1.0 
3.7 
3.7 
0.66 
0.59 
1.4 
0.60 
1.4 
0.60 
0.60 
2.8 
0.091 
2.8 
1.4 
1.0 
1.0 
0.60 
0.66 
0.66 



3.8 
0.S2 
1.1 
l-l 
1.0 
0.63 
0.62 
l.l 
0.62 
1.1 
1.1 
0-75 
U 
0.75 
0.62 
0.82 
0.82 
1.1 
1.0 
1.0 



1.3 
0.69 
2-4 
2.4 
1.3 
1.0 
0.55 
0.57 
0.55 
0.57 
0.57 

1.3 
4.9 

1.3 
0.55 
0.69 
0.69 
0.57 

1.3 

1.3 



0.56 
0.69 
0.60 
0.60 
t.7 

0.33 
0.77 

3.3 

0.77 

33 
33 

QJI 
3.6 

QJI 

0.77 
0.69 
0.69 
33 
1.7 
1.7 



1.7 
22 

16 
16 
0.23 
3.8 
4.7 
0.65 
4.7 
0.65 
0.65 
0.89 
1.4 
0.89 
4.7 



0.65 
0.23 
0.23 



3.0 

1.1 

13 

2.8 

1.3 

2.6 
0.085 

0.32 
0.085 

0.32 

0J2 

0.91 

25 

0.91 
0.085 
l.l 
l.l 

0.52 
1.3 
1.3 



£00010 



prcfaT«i «a<l t CDfenial mklM d tlw mmm Mdw pongpsa. AIL, ,™ „. 

^o«oU( af <Aon«»tly unuUf (w« «.$_ t«t 4V At jomim vmtM amcemdsf a » or paw * 

^i^t-f ud mUvct JctcrmiwI u b« ptcfend or lolmBri ucAon «« •dio*4 by Toat. AJL3 vOm k A« Mcfaor po w tiow * 

XochSmI i» Ftfw» 3 **i *. ToiOaw m of*« iWs W thu »W« w ecnAacKt far ptvriienv* JjwSAim. 4i vtl»e» foe »—^,_ 

baa vrfioO 001. eq«i**low to • lOOO-feU nri«cti» a bW'vf o^«dir. to Tikn o«t m-«ou/ pcyaic*. Tbc »<tnf» je««: Wk< ap««y of ti«p«»el**« IS 
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Table 12 



a. 9-mer peptides 



Residue 



Position (ARB*) 



A 
C 
D 
E 
F 
G 
H 
I 

K 
L 
M 
N 
P 

Q 
R 
S 
T 
V 
W 
Y 




oooio i 



3 


4 


5 






3 


0.91 


1 6 


0.74 


ft 7t 


13 


053 


0.47 


1.1 


1.4 


0.75 


0.72 


1.6 


031 


1.4 


12 


1.2 


021 


0.64 


031 


1.4 


12 


1.2 


0.21 


0.64 


1.4 


OAS 


1.9 


1.6 


10 


33 


0 33 


2.4 


0.24 


034 


031 


032 


0.73 


0.47 


0.62 


0.6X 


035 


033 


4.1 


032 


036 


2.4 


0.74 


0.46 


0.73 


0.47 


0.62 


0.61 


035 


033 


4.1 


0.32 


036 


2.4 


0.74 


0.46 


4.1 


032 


036 


14 


0.74 


0.46 


0.39 


0.29 


10 


0.94 


13 


1.0 


0,47 


QJZ 


Q2Z 


OAS 


11 


1.4 


039 


0 79 


XO 


0.94 


13 


1.0 


0A3 


0.47 


0.62 


0.61 


035 


033 


0.47 


1.1 


1.4 


0.75 


0.72 


1.6 


0.47 


1.1 


1.4 


0.75 


0.72 


1.6 


4.1 


032 


0.36 


14 


0.74 


0.46 


1.4 


035 


1.9 


t.6 


10 


3.3 


1.4 


0.35 


1.9 


1.6 


10 


33 





^OJOIO- 



" A o( y>i Xce*, 5~«l oa a^ly accuroj ^ ^ Ail pcrx*^ 
JudUtcwU^fefr^l^l (okruftlrttMlu.* tb. mtuiwcbor ARB vtlu«f tbowq »err cakuLued u Jejcnbed a Ac muviali «d 
nrwthcxi, P«P«J of ch«™«l]y ««Ur e j, ref 6). At *a»i*y i«bcr ?c,(«u ccrmpoodm, lo 1 3-fcW 
Of p^tf a Sl^>( t ap* lty „ *d»^«i by r=»L tir «u « ^ufi^ ^ root ^ ^pavt t Xcc* by 
uaikrf^ ud ,ul 1M2 «i (a* **Scr pouuxu « ,h*kd- iflJ rwduc dtum*^ a be p«f«Ttd cr to-^ud arKJtac « ~< f by tokl 

u coeffic.taa for pc^Civ* algonhm*. tb. viiua far W-iofenrt «i»Jt» be« *t to 0 001 a^tkn! (a * tOCCfeU rw««ix» a 



b. 10-mer peptides 



Position (ARB*) 



Residue 1 


A 


14 


C 


0.61 


D 


0 068 


E 


0.0 fit 


F 


3.0 


G 


0.71 


H 


1.4 


I 


0.42 


K 


1.4 


L 


0.42 


M 


0.42 


N 


6.1 


P 


QJZ 


Q 


6.1 


R 


1.4 


S 


0.61 


T 


0.6t 


V 


0.42 


W 


3.0 


Y 


30 



3 


4 


5 


6 


7 


3 


9 


0.62 


1-2 


11 


035 


QJZ 


033 


5.3 


021 


0.71 


1.4 


030 


036 


1.2 


0.78 


0.099 


17 


11 


32 


1.2 


033 


4.0 


0.099 


17 


11 


3-2 


1.2 


033 


4.0 


4.1 


030 


12 


16 


13 


11 


0.45 


0.07? 


031 


0.61 


0.43 


0.71 


0.73 


0.41 


QJZ 


036 


0-66 


0.36 


0.96 


5.0 


021 


33 


0.67 


0.76 


0.90 


4.9 


0.79 


1.0 


CJZ 


036 


0.66 


036 


0.96 


5.0 


022. 


3.3 


0.67 


0.76 


0.90 


4.9 


0.79 


to 


3.8 


0.67 


0.76 


0.90 


4.9 


0.79 


1j0 


0.28 


13 


0.47 


032 


QJA 


0.20 


034 


034 


12 


037 


033 


0.26 


13 


3.6 


0.28 


1.3 


0.47 


032 


QJA 


0.20 


034 


QJZ 


036 


0.66 


036 


0.96 


5.0 


QJ5 


0.23 


0.71 


1.4 


030 


036 


12 


0.78 


0.23 


0.71 


1.4 


030 


036 


12 


073 


3.8 


0.67 


0J6 


0.90 


4.9 


0.79 


to 


4.1 


0.80 


1.2 


16 


13 


11 


0.45 


4.1 


030 


12 


16 


13 


It 


0.45 



zoioio I 




100010/ * 
vojoictJ 

^.OlflOlO^ 



Ml 



1) Ap^oH:Qt<ViTW?€pi^b*^< M aicnriUyo«^ All p*pdd« had * lotf I 

fxrfemfwdl uknrtru^itfeauMCfcrpouiou. AR3 vttuts ibm wtnakalatod u<kicnb<rf (nunal< wt amtedL ittl m tewl « A« 
tWWtfdto^y ^Ur «^(»« At wo^ 

ladtCMadbyidauMdroet. P«^m elfacu « fon&cr wteuiTMd by bokkd foot arsw** tff«cu by wkrlAod ukJ Jxbeiud i»t Mm taebor powlxm «« 
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Table 13 



a. 9-mer peptides 



Residue 
A 
C 
D 
E 
F 
G 
H 
I 

K 
L 
M 
N 
P 
Q 
R 
S 
T 
V 
W 
Y 



Position (ARB*) 




0.16 

2.2 
XI 

0 83 
0.49 
0.33 
0 16 

1.0 

1 0 
2.2 
7.6 
7.6 



^&Q52g| 0.88 
^HH 0.27 




0,98 
0.92 
0-70 
a70 

u 

2-5 
0.55 

1.3 
0J5 

U 

1J 
0.45 

0.43 

0.45 
0.55 
0.92 
0.92 
1.3 
1.4 
1.4 



0.71 
0.95 
0.23 
028 

1.3 
0.43 
0.68 

1.1 
0.63 

1-1 

l.i 
0.36 

24 
0.36 
0.63 
0.95 
0.95 

1.1 

1.8 

1.3 



QJ4 
1.7 
0.70 
0.70 
2.3 
0.53 
3.2 
0.80 
3.2 
0.80 
0.80 
0.71 
13 
0.71 
3.2 

1.7 
1.7 

aso 

13 
2.3 



3.4 
0.60 
0J6 
0J6 
1.5 
0.85 
3.2 

0.65 
3.2 
0.65 

a<sj 
a46 

0.71 

a46 

3.2 
0.60 
0.60 
0.65 
U 
U 



0,71 

0.75 
a45 
0.45 
11 
1.9 
1.5 
0.57 
U 
0.57 

0- 57 
1.8 
1.7 
1.3 

1- 5 
0.75 

a75 

0-57 
11 
11 
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b. 10-mer peptides 



Residue 
_ 

C 
D 
E 
F 
G 
H 
I 

K 
L 
M 
N 
P 
Q 

R 
S 
T 
V 

w 

Y 



^"(WOIOVj 

1 



Position (ARB*) 



3 


4 


5 


6 


7 


3 


9 


5.6 


3.5 


17 


0.69 


0.71 


1.3 


1.4 


1.4 


\A 


0 20 


0.72 


0 26 


U 


0.55 


00*2 


4.8 


43 


0.63 


0.23 


010 


1.2 


0.042 


4.8 


4.3 


0.63 


0 23 


0 10 


1.2 


17 


1.4 


1.3 


U 


4.9 


0.93 


12 


a/5 


033 


11 


0J4 


[J 




0.66 




OAS 


0.70 


0 18 


3.8 


3.1 


0.33 


t.2 


1.2 


18 


1.8 


1.7 


0.94 


0.74 


a/* 


m 


0.70 


0.18 


3.3 


3.1 


0.33 


1.2 


1.2 


13 


1.3 


1.7 


0.96 


0.74 


1.2 


t.2 


13 


1.3 


1.7 


0.96 


0.74 


022 


1.5 


0.20 


8.4 


3^ 


0.31 


1.6 


0.17 


0.045 


0.090 


060 


on 


0.96 


1.3 


021 


U 


0.20 


3.4 


3.2 


0 31 


1.6 


016 


0J5 


0.70 


013 


3.3 


3.1 


033 


1.4 


1.4 


0.20 


0.72 


0.26 


1.1 


0.55 


1.4 


1.4 


0.20 


0.72 


0.26 


1.1 


0-55 


L2 


1.2 


13 


1.3 


1.7 


0.96 


0.74 


17 


1.4 


13 


U 


4.9 


0.93 


12 


17 


1.4 


U 


U 


4.9 


0.93 


22 



050 
II 
3.2 
3.2 
1.1 
0 086 

0.73 

1.2 
0 73 

1.2 

1.2 

16 
115 

16 
0.73 

It 

2.1 

1.2 

l.I 

1.1 
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Table 14 



a. Crossreactivity between A2-supertype molecules 



A*0201 
A*0202 
A*0203 
A*0206 
A*6802 



%binders crossreacting with: 



Allele A-0201 A*0202 A*0203 A*0206 A*6802 




b. Degeneracy of A*0201 binders. 



A*0201 




A2-supertype alleles bound 


(%of peptides) 






affinity 


0 


1 


2 


3 


4 


5 


>=3 


<=20 


0.0 


0.0 


3.5 


17.5 


36.8 


42.1 


96.5 


<=100 


0.0 


3.6 


11.2 


21.4 


34.7 


29.1 


85.2 


<=500 


0.0 


7.1 


20.1 


25.1 


28.3 


19.3 


72.8 


>500 


40.0 


33.3 


16.7 


10.0 


0.0 


0.0 


10.0 
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